Computational Dynamics 

--Dynamic Systems, Molecular 
Dynamics, Computational Sciences, 
Quantum Dynamics and Complex 
Systems Dynamics 



PDF generated using the open source mwlib toolkit. See http://code.pediapress.com/ for more information. 

PDF generated at: Thu, 04 Jun 2009 21:04:23 UTC 



Dynamic system 



Dynamics Systems and Molecular 

Dynamics 



Dynamic system 



1. REDIRECT Dynamicalsystem 

Dynamical systems theory 

Dynamical systems theory is an area of applied mathematics used to describe the 
behavior of complex dynamical systems, usually by employing differential equations or 
difference equations. When differential equations are employed, the theory is called 
continuous dynamical systems. When difference equations are employed, the theory is 
called discrete dynamical systems. When the time variable runs over a set which is discrete 
over some intervals and continuous over other intervals or is any arbitrary time-set such as 
a cantor set then one gets dynamic equations on time scales. Some situations may also be 
modelled by mixed operators such as differential-difference equations. 

This theory deals with the long-term qualitative behavior of dynamical systems, and the 
studies of the solutions to the equations of motion of systems that are primarily mechanical 
in nature; although this includes both planetary orbits as well as the behaviour of electronic 
circuits and the solutions to partial differential equations that arise in biology. Much of 
modern research is focused on the study of chaotic systems. 

This field of study is also called just Dynamical systems, Systems theory or longer as 
Mathematical Dynamical Systems Theory and the Mathematical theory of dynamical 
systems. 
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Overview 

Dynamical systems theory and chaos theory 
deal with the long-term qualitative behavior 
of dynamical systems. Here, the focus is not 
on finding precise solutions to the equations 
defining the dynamical system (which is often 
hopeless), but rather to answer questions like 
"Will the system settle down to a steady state 
in the long term, and if so, what are the 
possible steady states?", or "Does the 
long-term behavior of the system depend on 
its initial condition?" 

An important goal is to describe the fixed 
points, or steady states of a given dynamical 
system; these are values of the variable 
which won't change over time. Some of these 
fixed points are attractive, meaning that if 
the system starts out in a nearby state, it will converge towards the fixed point. 

Similarly, one is interested in periodic points, states of the system which repeat themselves 
after several timesteps. Periodic points can also be attractive. Sarkovskii's theorem is an 
interesting statement about the number of periodic points of a one-dimensional discrete 
dynamical system. 

Even simple nonlinear dynamical systems often exhibit almost random, completely 
unpredictable behavior that has been called chaos. The branch of dynamical systems which 
deals with the clean definition and investigation of chaos is called chaos theory. 




The Lorenz attractor is an example of a non-linear 

dynamical system. Studying this system helped give 

rise to Chaos theory. 



History 

The concept of dynamical systems theory has its origins in Newtonian mechanics. There, as 
in other natural sciences and engineering disciplines, the evolution rule of dynamical 
systems is given implicitly by a relation that gives the state of the system only a short time 
into the future. 

Before the advent of fast computing machines, solving a dynamical system required 
sophisticated mathematical techniques and could only be accomplished for a small class of 
dynamical systems. 

Some excellent presentations of mathematical dynamic system theory include Beltrami 
(1987), Luenberger (1979), Padula and Arbib (1974), and Strogatz (1994). [1] 
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Concepts 

Dynamical systems 

The dynamical system concept is a mathematical formalization for any fixed "rule" which 
describes the time dependence of a point's position in its ambient space. Examples include 
the -» mathematical models that describe the swinging of a clock pendulum, the flow of 
water in a pipe, and the number of fish each spring in a lake. 

A dynamical system has a state determined by a collection of real numbers, or more 
generally by a set of points in an appropriate state space. Small changes in the state of the 
system correspond to small changes in the numbers. The numbers are also the coordinates 
of a geometrical space— a manifold. The evolution rule of the dynamical system is a fixed 
rule that describes what future states follow from the current state. The rule is 
deterministic: for a given time interval only one future state follows from the current state. 

Dynamicism 

Dynamicism, also termed the dynamic hypothesis or the dynamic hypothesis in cognitive 
science or dynamic cognition, is a new approach in cognitive science exemplified by the 
work of philosopher Tim van Gelder. It argues that differential equations are more suited to 
modelling cognition than more traditional computer models. 

Nonlinear system 

In mathematics, a nonlinear system is a system which is not linear, i.e. a system which does 
not satisfy the superposition principle. Less technically, a nonlinear system is any problem 
where the variable(s) to be solved for cannot be written as a linear sum of independent 
components. A nonhomogenous system, which is linear apart from the presence of a 
function of the independent variables, is nonlinear according to a strict definition, but such 
systems are usually studied alongside linear systems, because they can be transformed to a 
linear system as long as a particular solution is known. 

Related fields 

Arithmetic dynamics 

Arithmetic dynamics is a relatively new field that amalgamates two areas of 
mathematics, dynamical systems and number theory. Classically, discrete dynamics 
refers to the study of the iteration of self-maps of the complex plane or real line. 
Arithmetic dynamics is the study of the number-theoretic properties of integer, 
rational, p-adic, and/or algebraic points under repeated application of a polynomial or 
rational function. 

Chaos theory 

Chaos theory describes the behavior of certain dynamical systems - that is, systems 
whose state evolves with time - that may exhibit dynamics that are highly sensitive to 
initial conditions (popularly referred to as the butterfly effect). As a result of this 
sensitivity, which manifests itself as an exponential growth of perturbations in the 
initial conditions, the behavior of chaotic systems appears to be random. This happens 
even though these systems are deterministic, meaning that their future dynamics are 
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fully defined by their initial conditions, with no random elements involved. This 
behavior is known as deterministic chaos, or simply chaos. 

Complex systems 

Complex systems is a scientific field, which studies the common properties of systems 
considered complex in nature, society and science. It is also called complex systems 
theory, complexity science, study of complex systems and/or sciences of complexity. 
The key problems of such systems are difficulties with their formal modeling and 
simulation. From such perspective, in different research contexts complex systems are 
defined on the base of their different attributes. 

The study of complex systems is bringing new vitality to many areas of science where a 
more typical reductionist strategy has fallen short. Complex systems is therefore often 
used as a broad term encompassing a research approach to problems in many diverse 
disciplines including neurosciences, social sciences, meteorology, chemistry, physics, 
computer science, psychology, artificial life, evolutionary computation, economics, 
earthquake prediction, molecular biology and inquiries into the nature of living cells 
themselves. 

Control theory 

Control theory is an interdisciplinary branch of engineering and mathematics, that 
deals with influencing the behavior of dynamical systems. 

Ergodic theory 

Ergodic theory is a branch of mathematics that studies dynamical systems with an 
invariant measure and related problems. Its initial development was motivated by 
problems of -> statistical physics. 

Functional analysis 

Functional analysis is the branch of mathematics, and specifically of analysis, 
concerned with the study of vector spaces and operators acting upon them. It has its 
historical roots in the study of functional spaces, in particular transformations of 
functions, such as the Fourier transform, as well as in the study of differential and 
integral equations. This usage of the word functional goes back to the calculus of 
variations, implying a function whose argument is a function. Its use in general has 
been attributed to mathematician and physicist Vito Volterra and its founding is 
largely attributed to mathematician Stefan Banach. 
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Graph dynamical systems 

The concept of graph dynamical systems (GDS) can be used to capture a wide range of 
processes taking place on graphs or networks. A major theme in the mathematical and 
computational analysis of GDS is to relate their structural properties (e.g. the network 
connectivity) and the global dynamics that result. 

Projected dynamical systems 

Projected dynamical systems is a mathematical theory investigating the behaviour of 
dynamical systems where solutions are restricted to a constraint set. The discipline 
shares connections to and applications with both the static world of optimization and 
equilibrium problems and the dynamical world of ordinary differential equations. A 
projected dynamical system is given by the flow to the projected differential equation. 

Symbolic dynamics 

Symbolic dynamics is the practice of modelling a topological or smooth dynamical 
system by a discrete space consisting of infinite sequences of abstract symbols, each of 
which corresponds to a state of the system, with the dynamics (evolution) given by the 
shift operator. 

System dynamics 

System dynamics is an approach to understanding the behaviour of complex systems 
over time. It deals with internal feedback loops and time delays that affect the 
behaviour of the entire system. What makes using system dynamics different from 
other approaches to studying complex systems is the use of feedback loops and stocks 
and flows. These elements help describe how even seemingly simple systems display 
baffling nonlinearity. 

Topological dynamics 

-» Topological dynamics is a branch of the theory of dynamical systems in which 
qualitative, asymptotic properties of dynamical systems are studied from the viewpoint 
of general topology. 

Applications 

In biomechanics 

In sports biomechanics, dynamical systems theory has emerged in the movement sciences 
as a viable framework for modeling athletic performance. From a dynamical systems 
perspective, the human movement system is a highly intricate network of co-dependent 
sub-systems (e.g. respiratory, circulatory, nervous, skeletomuscular, perceptual) that are 
composed of a large number of interacting components (e.g. blood cells, oxygen molecules, 
muscle tissue, metabolic enzymes, connective tissue and bone). In dynamical systems 
theory, movement patterns emerge through generic processes of self-organization found in 
physical and biological systems. 
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In cognitive science 

Dynamical system theory has recently emerged in the field of cognitive development. It is 
the belief that cognitive development is best represented by physical theories rather than 
theories based on syntax and AI. It also believes that differential equations are the most 
appropriate tool for modeling human behavior. These equations are interpreted to 
represent an agent's cognitive trajectory through state space. In other words, dynamicists 
argue that psychology should be (or is) the description (via differential equations) of the 
cognitions and behaviors of an agent under certain environmental and internal pressures. 
The language of chaos theory is also frequently adopted. 

In it, the learner's mind reaches a state of disequilibrium where old patterns have broken 
down. This is the phase transition of cognitive development. Self organization (the 
spontaneous creation of coherent forms) sets in as activity levels link to each other. Newly 
formed macroscopic and microscopic structures support each other, speeding up the 
process. These links form the structure of a new state of order in the mind through a 
process called scalloping (the repeated building up and collapsing of complex 
performance.) This new, novel state is progressive, discrete, idiosyncratic and 
unpredictable. 

Dynamic systems theory has recently been used to explain a long-unanswered problem in 
child development referred to as the A-not-B error. 

See also 

Related subjects 
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Caratheodory's theorem (conformal 
mapping) 

See also Caratheodory's theorem for other meanings. 

In mathematics, Caratheodory's theorem in complex 
analysis states that if U is a simply connected open 
subset of the complex plane C, whose boundary is a 
Jordan curve V then the Riemann map 

f:U->D 

from U to the unit disk D extends continuously to the 
boundary, qivinq a homeomorphism 

F -. T-^S 1 

from r to the unit circle S . 

Such a reqion is called a Jordan domain. Equivalently, this theorem states that for such sets 
U there is a homeomorphism 

F : cl(U) -» cl(D) 
from the closure of U to the closed unit disk cl(D) whose restriction to the interior is a 
Riemann map, i.e. it is a bijective holomorphic conformal map. 

Another standard formulation of Caratheodory's theorem states that for any pair of simply 
connected open sets U and V bounded by Jordan curves l~ and V , a conformal map 

f:U-*V 

extends to a homeomorphism 

F-. r, -» r 2 . 

This version can be derived from the one stated above by composinq the inverse of one 
Riemann map with the other. 

A more qeneral version of Caratheodory's theorem is the followinq. Let 

g-.D^U 

be the inverse of the Riemann map, where D C C is the unit disk, and U C C is a simply 
connected domain. Then g extends continuously to 

G : cl(D) -» cl(L0 
if and only if the boundary of U is locally connected. 
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Context 

Intuitively, Caratheodory's theorem says that compared to general simply connected open 
sets in the complex plane C, those bounded by Jordan curves are particularly well-behaved. 

Caratheodory's theorem is a basic result in the study of boundary behavior of conformal 
maps, a classical part of complex analysis. In general it is very difficult to decide whether or 
not the Riemann map from an open set U to the unit disk D extends continuously to the 
boundary, and how and why it may fail to do so at certain points. 

While having a Jordan curve boundary is sufficient for such an extension to exist, it is by no 
means necessary. For example, the map 

m = z 2 

from the upper half-plane H to the open set G that is the complement of the positive real 
axis is holomorphic and conformal, and it extends to a continuous map from the real line R 
to the positive real axis R + ; however, the set G is not bounded by a Jordan curve. 

Dynamics of Markovian particles 

Dynamics of Markovian particles (or DMP) is the basis of a theory for kinetics of 
particles in open heterogeneous systems. It can be looked upon as an application of the 
notion of stochastic process conceived as a physical entity; e.g. the particle moves because 
there is a transition probability acting on it. 

Two particular features of DMP might be noticed: (1) an ergodic like relation between the 
motion of particle and the corresponding steady state, and (2) the classic notion of 
geometric volume appears nowhere (e.g. a concept such as flow of "substance" is not 
expressed as liters per time unit but as number of particles per time unit). Though being 
primitive DMP has been applied for solving a classic paradox of the absorption of mercury 
by fish and by mollusks. The theory has also been applied for a purely probabilistic 
derivation of the fundamental physical principle: conservation of mass; this might be looked 
upon as a contribution to the old and ongoing discussion of the relation between physics 
and probability theory. 

Sources 

• Bergner— DMP, a kinetics of macroscopic particles in open heterogeneous systems 

References 

[1] http://www.bergner.se/DMP/download.htm 
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Statistical physics 

Statistical physics is the area of physics that uses methods of probability theory and 
statistics, and particularly the mathematical tools for dealing with large populations, in 
solving physical problems. It can describe a wide variety of fields with an inherently 
stochastic nature. Examples include problems involving nuclear reactions, and topics in the 
fields of biology, chemistry, neurology and even some social sciences such as sociology. 

Historically, one of the first topics in physics where statistical methods were applied was 
the field of mechanics, which is concerned with the motion of particles or objects when 
subjected to a force. -» Statistical mechanics provides a framework for relating the 
microscopic properties of individual atoms and molecules to the macroscopic or bulk 
properties of materials that can be observed in everyday life, therefore explaining 
thermodynamics as a natural result of statistics and mechanics (classical and quantum) at 
the microscopic level. Because of this history, the term "statistical physics" is therefore 
sometimes used as a synonym for statistical mechanics or statistical thermodynamics, 
rather than in the wider sense considered in this article. 

A statistical approach can work well in classical systems when the number of degrees of 
freedom (and so the number of variables) is so large that exact solution is not possible, or 
not really useful. Statistical mechanics can also describe work in non-linear dynamics, 
chaos theory, thermal physics, fluid dynamics (particularly at high Knudsen numbers), or 
plasma physics. 

Although some problems in statistical physics can be solved analytically using 
approximations and expansions, most current research utilizes the large processing power 
of modern computers to simulate or approximate solutions. A common approach to 
statistical problems is to use a Monte Carlo simulation to yield insight into the dynamics of 
a complex system. 

See also 

Statistical ensemble 

Statistical field theory 

Mean sojourn time 

-» Dynamics of Markovian particles 

Complex network 

Mathematical physics 

Combinatorics and physics 
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Statistical mechanics 

Statistical mechanics (or statistical thermodynamics ) is the application of 
probability theory, which includes mathematical tools for dealing with large populations, to 
the field of mechanics, which is concerned with the motion of particles or objects when 
subjected to a force. It provides a framework for relating the microscopic properties of 
individual atoms and molecules to the macroscopic or bulk properties of materials that can 
be observed in everyday life, therefore explaining thermodynamics as a natural result of 
statistics and mechanics (classical and quantum) at the microscopic level. 

It provides a molecular-level interpretation of thermodynamic quantities such as work, 
heat, free energy, and entropy, allowing the thermodynamic properties of bulk materials to 
be related to the spectroscopic data of individual molecules. This ability to make 
macroscopic predictions based on microscopic properties is the main advantage of 
statistical mechanics over classical thermodynamics. Both theories are governed by the 
second law of thermodynamics through the medium of entropy. However, entropy in 
thermodynamics can only be known empirically, whereas in statistical mechanics, it is a 
function of the distribution of the system on its micro-states. 

Statistical thermodynamics was born in 1870 with the work of Austrian physicist Ludwig 
Boltzmann, much of which was collectively published in Boltzmann's 1896 Lectures on Gas 
Theory} ' Boltzmann's original papers on the statistical interpretation of thermodynamics, 
the H-theorem, transport theory, thermal equilibrium, the equation of state of gases, and 
similar subjects, occupy about 2,000 pages in the proceedings of the Vienna Academy and 
other societies. The term "statistical thermodynamics" was proposed for use by the 
American thermodynamicist and physical chemist J. Willard Gibbs in 1902. According to 
Gibbs, the term "statistical", in the context of mechanics, i.e. statistical mechanics, was first 
used by the Scottish physicist James Clerk Maxwell in 1871. 

Overview 

The essential problem in statistical thermodynamics is to determine the distribution of a 
given amount of energy E over N identical systems. The goal of statistical 
thermodynamics is to understand and to interpret the measurable macroscopic properties 
of materials in terms of the properties of their constituent particles and the interactions 
between them. This is done by connecting thermodynamic functions to quantum-mechanic 
equations. Two central quantities in statistical thermodynamics are the Boltzmann factor 
and the partition function. 
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Fundamentals 

Central topics covered in statistical thermodynamics include: 

Microstates and configurations 

Boltzmann distribution law 

Partition function, Configuration integral or configurational partition function 

Thermodynamic equilibrium - thermal, mechanical, and chemical. 

Internal degrees of freedom - rotation, vibration, electronic excitation, etc. 

Heat capacity - Einstein solids, polyatomic gases, etc. 

Nernst heat theorem 

Fluctuations 

Gibbs paradox 

Degeneracy 

Lastly, and most importantly, the formal definition of entropy of a thermodynamic system 
from a statistical perspective is called statistical entropy, and is defined as: 

S = k B lnn 
where 

k D is Boltzmann's constant 1.38066xl0 -23 J K _1 and 

B J 

ft is the number of microstates corresponding to the observed thermodynamic 

macrostate. 
A common mistake is taking this formula as a hard general definition of entropy. This 
equation is valid only if each microstate is equally accessible (each microstate has an equal 
probability of occurring). 

Boltzmann Distribution 

If the system is large the Boltzmann distribution could be used (the Boltzmann distribution 
is an approximate result) 

71; <X e k B T . 



n. 
This can now be used with Pi 



N' 



n, 



N yiEll levels -i^r 

History 

In 1738, Swiss physicist and mathematician Daniel Bernoulli published Hydrodynamica 
which laid the basis for the kinetic theory of gases. In this work, Bernoulli positioned the 
argument, still used to this day, that gases consist of great numbers of molecules moving in 
all directions, that their impact on a surface causes the gas pressure that we feel, and that 
what we experience as heat is simply the kinetic energy of their motion. 

In 1859, after reading a paper on the diffusion of molecules by Rudolf Clausius, Scottish 
physicist James Clerk Maxwell formulated the Maxwell distribution of molecular velocities, 
which gave the proportion of molecules having a certain velocity in a specific range. This 
was the first-ever statistical law in physics. Five years later, in 1864, Ludwig Boltzmann, 
a young student in Vienna, came across Maxwell's paper and was so inspired by it that he 
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spent much of his long and distinguished life developing the subject further. 

Hence, the foundations of statistical thermodynamics were laid down in the late 1800s by 
those such as Maxwell, Ludwig Boltzmann, Max Planck, Rudolf Clausius, and Willard Gibbs 
who began to apply statistical and quantum atomic theory to ideal gas bodies. 
Predominantly, however, it was Maxwell and Boltzmann, working independently, who 
reached similar conclusions as to the statistical nature of gaseous bodies. Yet, one must 
consider Boltzmann to be the "father" of statistical thermodynamics with his 1875 
derivation of the relationship between entropy S and multiplicity O, the number of 
microscopic arrangements (microstates) producing the same macroscopic state 
(macrostate) for a particular system. ] 

Fundamental postulate 

The fundamental postulate in statistical mechanics (also known as the equal a priori 
probability postulate) is the following: 

Given an isolated system in equilibrium, it is found with equal probability in each of its 
accessible microstates. 

This postulate is a fundamental assumption in statistical mechanics - it states that a system 
in equilibrium does not have any preference for any of its available microstates. Given Q 
microstates at a particular energy, the probability of finding the system in a particular 
microstate is p = 1/Q. 

This postulate is necessary because it allows one to conclude that for a system at 
equilibrium, the thermodynamic state (macrostate) which could result from the largest 
number of microstates is also the most probable macrostate of the system. 

The postulate is justified in part, for classical systems, by Liouville's theorem (Hamiltonian), 
which shows that if the distribution of system points through accessible phase space is 
uniform at some time, it remains so at later times. 

Similar justification for a discrete system is provided by the mechanism of detailed balance. 

This allows for the definition of the information function (in the context of information 
theory): 

J = -"^pilnpi = {Inp}. 
i 
When all the probabilities (rhos) are equal, I is maximal, and we have minimal information 

about the system. When our information is maximal (i.e., one rho is equal to one and the 

rest to zero, such that we know what state the system is in), the function is minimal. 

This "information function" is the same as the reduced entropic function in 

thermodynamics. 
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Statistical ensembles 

Microcanonical ensemble 

In microcanonical ensemble N, V and E are fixed. Since the second law of thermodynamics 
applies to isolated systems, the first case investigated will correspond to this case. The 
Microcanonical ensemble describes an isolated system. 

The entropy of such a system can only increase, so that the maximum of its entropy 
corresponds to an equilibrium state for the system. 

Because an isolated system keeps a constant energy, the total energy of the system does 
not fluctuate. Thus, the system can access only those of its micro-states that correspond to 
a given value E of the energy. The internal energy of the system is then strictly equal to its 
energy. 

Let us call Q{E) the number of micro-states corresponding to this value of the system's 
energy. The macroscopic state of maximal entropy for the system is the one in which all 
micro-states are equally likely to occur, with probability l/fi(.E), during the system's 
fluctuations. 

S = 






where 

S is the system entropy, and 

kg is Boltzmann's constant. 

Canonical ensemble 

In canonical ensemble N, V and T are fixed. Invoking the concept of the canonical 
ensemble, it is possible to derive the probability Pi that a macroscopic system in thermal 
equilibrium with its environment, will be in a given microstate with energy E; according to 
the Boltzmann distribution: 



Pi = 



e -m 



where 3 = — , 

The temperature T arises from the fact that the system is in thermal equilibrium with its 
environment. The probabilities of the various microstates must add to one, and the 
normalization factor in the denominator is the canonical partition function: 

J-max 

..■ 
where Et is the energy of the i th microstate of the system. The partition function is a 
measure of the number of states accessible to the system at a given temperature. The 
article canonical ensemble contains a derivation of Boltzmann's factor and the form of the 
partition function from first principles. 

To sum up, the probability of finding a system at temperature T in a particular state with 
energy E { is 
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Thermodynamic Connection 

The partition function can be used to find the expected (average) value of any microscopic 
property of the system, which can then be related to macroscopic variables. For instance, 
the expected value of the microscopic energy E is interpreted as the microscopic definition 
of the thermodynamic variable internal energy U , and can be obtained by taking the 
derivative of the partition function with respect to the temperature. Indeed, 

(E) gjj^!f: 1^ 

V ' Z Zdd 

implies, together with the interpretation of (E) as U , the following microscopic definition 
of internal energy: 

The entropy can be calculated by (see Shannon entropy) 
S e-P E ' 

"- ■' ; £* 

\ \ 

which implies that 

jm = u _ TS = F 

Q 
is the free energy of the system or in other words, 

Z = e-0* 
Having microscopic expressions for the basic thermodynamic potentials U (internal 
energy), S (entropy) and F (free energy) is sufficient to derive expressions for other 
thermodynamic quantities. The basic strategy is as follows. There may be an intensive or 
extensive quantity that enters explicitly in the expression for the microscopic energy Ei , 
for instance magnetic field (intensive) or volume (extensive). Then, the conjugate 
thermodynamic variables are derivatives of the internal energy. The macroscopic 
magnetization (extensive) is the derivative of U with respect to the (intensive) magnetic 
field, and the pressure (intensive) is the derivative of U with respect to volume (extensive). 
The treatment in this section assumes no exchange of matter (i.e. fixed mass and fixed 
particle numbers). However, the volume of the system is variable which means the density 
is also variable. 

This probability can be used to find the average value, which corresponds to the 
macroscopic value, of any property, J , that depends on the energetic state of the system 
by using the formula: 

e~ pEi 

i i 

where {J) is the average value of property J . This equation can be applied to the internal 
energy, U : 

p -pBi 

Subsequently, these equations can be combined with known thermodynamic relationships 
between U and V" to arrive at an expression for pressure in terms of only temperature, 
volume and the partition function. Similar relationships in terms of the partition function 
can be derived for other thermodynamic properties as shown in the following table; see also 
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the detailed explanation in configuration integral 
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Helmholtz free energy: 
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Internal energy: 
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Pressure: 
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Entropy: 


3 = fc(ln Z + BU) 


Gibbs free energy: 


°"+™ --" + *ffl„ 


Enthalpy: 


H = U + PV 


Constant volume heat capacity: 


MIL 


Constant pressure heat capacity: 


*-(S)„ 


Chemical potential: 


1 (d\nZ\ 



To clarify, this is not a grand canonical ensemble. 

It is often useful to consider the energy of a given molecule to be distributed among a 
number of modes. For example, translational energy refers to that portion of energy 
associated with the motion of the center of mass of the molecule. Configurational energy 
refers to that portion of energy associated with the various attractive and repulsive forces 
between molecules in a system. The other modes are all considered to be internal to each 
molecule. They include rotational, vibrational, electronic and nuclear modes. If we assume 
that each mode is independent (a questionable assumption) the total energy can be 
expressed as the sum of each of the components: 

hi = hit -\- h/ c -\- hj n -\- ±L e -\- i% -|- hi v 
Where the subscripts t , c t n, e, r t and u correspond to translational, configurational, 
nuclear, electronic, rotational and vibrational modes, respectively. The relationship in this 
equation can be substituted into the very first equation to give: 
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If we can assume all these modes are completely uncoupled and uncorrelated, so all these 
factors are in a probability sense completely independent, then 

Thus a partition function can be defined for each mode. Simple expressions have been 
derived relating each of the various modes to various measurable molecular properties, 
such as the characteristic rotational or vibrational frequencies. 

Expressions for the various molecular partition functions are shown in the following table. 



Nuclear 
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Electronic 


Z e = Woe K™*+^- B ^ T +- 


Vibrational 


j 


Rotational (linear) 


T 
a 


Rotational (non-linear) 




1 I ttT* 
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Translational 


(2nmkTf^ 


Configurational (ideal gas) 


Z, = V 



These equations can be combined with those in the first table to determine the contribution 
of a particular energy mode to a thermodynamic property. For example the "rotational 
pressure" could be determined in this manner. The total pressure could be found by 
summing the pressure contributions from all of the individual modes, ie: 

P = P t + P c + P n + P e + Pt + P. 



Grand canonical ensemble 

In grand canonical ensemble V, T and chemical potential are fixed. If the system under 
study is an open system, (matter can be exchanged), but particle number is not conserved, 
we would have to introduce chemical potentials, |i., j = l,...,n and replace the canonical 
partition function with the grand canonical partition function: 



H(V,2»=X> X P [p 






where N.. is the number of i 

y 



th 



•th 



species particles in the i configuration. Sometimes, we also 
have other variables to add to the partition function, one corresponding to each conserved 
quantity. Most of them, however, can be safely interpreted as chemical potentials. In most 
condensed matter systems, things are nonrelativistic and mass is conserved. However, most 
condensed matter systems of interest also conserve particle number approximately 
(metastably) and the mass (nonrelativistically) is none other than the sum of the number of 
each type of particle times its mass. Mass is inversely related to density, which is the 
conjugate variable to pressure. For the rest of this article, we will ignore this complication 
and pretend chemical potentials don't matter. See grand canonical ensemble. 

Let's rework everything using a grand canonical ensemble this time. The volume is left 
fixed and does not figure in at all in this treatment. As before, j is the index for those 
particles of species j and i is the index for microstate z: 
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Grand potential: 
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Particle number: 


Al /3 V 9fM ), 


Entropy: 


S = k(\nE + SU -8^2 fiiNi) 

i 


Helmholtz free energy: 
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Equivalence between descriptions at the thermodynamic limit 

All of the above descriptions differ in the way they allow the given system to fluctuate 
between its configurations. 

In the micro-canonical ensemble, the system exchanges no energy with the outside world, 
and is therefore not subject to energy fluctuations; in the canonical ensemble, the system is 
free to exchange energy with the outside in the form of heat. 

In the thermodynamic limit, which is the limit of large systems, fluctuations become 
negligible, so that all these descriptions converge to the same description. In other words, 
the macroscopic behavior of a system does not depend on the particular ensemble used for 
its description. 

Given these considerations, the best ensemble to choose for the calculation of the 
properties of a macroscopic system is that ensemble which allows the result to be derived 
most easily. 

Random walks 

The study of long chain polymers has been a source of problems within the realms of 
statistical mechanics since about the 1950s. One of the reasons however that scientists 
were interested in their study is that the equations governing the behaviour of a polymer 
chain were independent of the chain chemistry. What is more, the governing equation turns 
out to be a random (diffusive) walk in space. Indeed, the Schrodinger equation is itself a 
diffusion equation in imaginary time, t' = it . 

Random walks in time 

The first example of a random walk is one in space, whereby a particle undergoes a random 
motion due to external forces in its surrounding medium. A typical example would be a 
pollen grain in a beaker of water. If one could somehow "dye" the path the pollen grain has 
taken, the path observed is defined as a random walk. 

Consider a toy problem, of a train moving along a ID track in the x-direction. Suppose that 
the train moves either a distance of + or - a fixed distance b, depending on whether a coin 
lands heads or tails when flipped. Lets start by considering the statistics of the steps the toy 
train takes (where Si is the ith step taken): 

(Si) = ; due to a priori equal probabilities 
{SiS 3 } = b 2 6ij. 

The second quantity is known as the correlation function. The delta is the kronecker delta 
which tells us that if the indices i and j are different, then the result is 0, but if i = j then the 
kronecker delta is 1, so the correlation function returns a value of b 2 - This makes sense, 
because if z = j then we are considering the same step. Rather trivially then it can be shown 
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that the average displacement of the train on the x-axis is 0; 

1=1 

AT 



z=l 



X=l 



As stated (Sj) is 0, so the sum of is still 0. It can also be shown, using the same method 
demonstrated above, to calculate the root mean square value of problem. The result of this 
calculation is given below 



■"rras y \ / DVjIi. 

From the diffusion equation it can be shown that the distance a diffusing particle moves in 
a media is proportional to the root of the time the system has been diffusing for, where the 
proportionality constant is the root of the diffusion constant. The above relation, although 
cosmetically different reveals similar physics, where N is simply the number of steps moved 
(is loosely connected with time) and b is the characteristic step length. As a consequence 
we can consider diffusion as a random walk process. 

Random walks in space 

Random walks in space can be thought of as snapshots of the path taken by a random 
walker in time. One such example is the spatial configuration of long chain polymers. 

There are two types of random walk in space: self-avoiding random walks, where the links 
of the polymer chain interact and do not overlap in space, and pure random walks, where 
the links of the polymer chain are non-interacting and links are free to lie on top of one 
another. The former type is most applicable to physical systems, but their solutions are 
harder to get at from first principles. 

By considering a freely jointed, non-interacting polymer chain, the end-to-end vector is 

N 

R = 2_^ r i where r i is the vector position of the z'-th link in the chain. As a result of the 

i=i 
central limit theorem, if N >> 1 then we expect a Gaussian distribution for the end-to-end 
vector. We can also make statements of the statistics of the links themselves; 
{i\} = 0; by the isotropy of space 

{rj ■ Tj) = 3£tJjj; all the links in the chain are uncorrelated with one another 
Using the statistics of the individual links, it is easily shown that {R} = and 
(R ■ R) = 3Nb . Notice this last result is the same as that found for random walks in time. 

Assuming, as stated, that that distribution of end-to-end vectors for a very large number of 
identical polymer chains is gaussian, the probability distribution has the following form 

1 -3R R 

p^Y /g eXP 2JV6 2 

What use is this to us? Recall that according to the principle of equally likely a priori 
probabilities, the number of microstates, Q, at some physical value is directly proportional 
to the probability distribution at that physical value, viz; 

Q (R) = cP (R) 
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where c is an arbitrary proportionality constant. Given our distribution function, there is a 

maxima corresponding to R = . Physically this amounts to there being more microstates 

which have an end-to-end vector of than any other microstate. Now by considering 

S(R)=fe fl lnfi(R) 

AS (R) = S (R) - S (0) 

AF = -TAS (R) 

where F is the Helmholtz free energy it is trivial to show that 

A Hookian spring! 

This result is known as the Entropic Spring Result and amounts to saying that upon 
stretching a polymer chain you are doing work on the system to drag it away from its 
(preferred) equilibrium state. An example of this is a common elastic band, composed of 
long chain (rubber) polymers. By stretching the elastic band you are doing work on the 
system and the band behaves like a conventional spring. What is particularly astonishing 
about this result however, is that the work done in stretching the polymer chain can be 
related entirely to the change in entropy of the system as a result of the stretching. 

Classical thermodynamics vs. statistical thermodynamics 

As an example, from a classical thermodynamics point of view one might ask what is it 
about a thermodynamic system of gas molecules, such as ammonia NH , that determines 
the free energy characteristic of that compound? Classical thermodynamics does not 
provide the answer. If, for example, we were given spectroscopic data, of this body of gas 
molecules, such as bond length, bond angle, bond rotation, and flexibility of the bonds in 
NH we should see that the free energy could not be other than it is. To prove this true, we 
need to bridge the gap between the microscopic realm of atoms and molecules and the 
macroscopic realm of classical thermodynamics. From physics, -» statistical mechanics 
provides such a bridge by teaching us how to conceive of a thermodynamic system as an 
assembly of units. More specifically, it demonstrates how the thermodynamic parameters of 
a system, such as temperature and pressure, are interpretable in terms of the parameters 
descriptive of such constituent atoms and molecules. 

In a bounded system, the crucial characteristic of these microscopic units is that their 
energies are quantized. That is, where the energies accessible to a macroscopic system 
form a virtual continuum of possibilities, the energies open to any of its submicroscopic 
components are limited to a discontinuous set of alternatives associated with integral 
values of some quantum number. 
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Notes 

[1] The terms "Statistical mechanics" and "statistical thermodynamics" are used interchangeably. "Statistical 

physics" is a broader term which includes statistical mechanics, but is sometimes also used as a synonym for 

statistical mechanics 
[2] On history of fundamentals of statistical thermodynamics (http://www.worldscibooks.com/phy_etextbook/ 

2012/2012_chap01.pdf) (section 1.2) 
[3] Schrodinger, Erwin (1946). Statistical Thermodynamics. Dover Publications, Inc.. ISBN 0-486-66101-6. OCLC 

20056858 (http://worldcat.org/oclc/20056858). 
[4] Mahon, Basil (2003). The Man Who Changed Everything - the Life of James Clerk Maxwell. Hoboken, NJ: 

Wiley. ISBN 0-470-86171-1. OCLC 52358254 62045217 (http://worldcat.org/oclc/52358254 + 62045217). 
[5] Perrot, Pierre (1998). AtoZ of Thermodynamics. Oxford University Press. ISBN 0-19-856552-6. OCLC 

123283342 38073404 (http://worldcat.org/oclc/123283342+38073404). 
[6] http://clesm.mae.ufl.edu/wiki.pub/index.php/Configuration_integral_%28statistical_mechanics%29 
[7] Nash, Leonard K. (1974). Elements of Statistical Thermodynamics, 2nd Ed.. Dover Publications, Inc.. ISBN 

0-486-44978-5. OCLC 61513215 (http://worldcat.org/oclc/61513215). 
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Timeline 



Molecular dynamics 



Molecular dynamics (MD) is a form of computer simulation in which atoms and molecules 
are allowed to interact for a period of time by approximations of known physics, giving a 
view of the motion of the atoms. Because molecular systems generally consist of a vast 
number of particles, it is impossible to find the properties of such complex systems 
analytically. When the number of bodies are more than two no analytical solutions can be 
found and result in chaotic motion (see n-body problem). MD simulation circumvents this 
problem by using numerical methods. It represents an interface between laboratory 
experiments and theory, and can be understood as a "virtual experiment". 

Molecular dynamics is that branch of chemistry, which deals with the study of computer 
simulation in which atoms and molecules are allowed to interact for a period of time by 
approximations of known physics. 

MD probes the relationship between molecular structure, movement and function. 
Molecular dynamics is a multidisciplinary method. Its laws and theories stem from 
mathematics, physics, and chemistry, and it employs algorithms from computer science and 
information theory. It was originally conceived within theoretical physics in the late 
1950s and early 1960s , but is applied today mostly in materials science and modeling 
of biomolecules. 

Before it became possible to simulate molecular dynamics with computers, some undertook 
the hard work of trying it with physical models such as macroscopic spheres. The idea was 
to arrange them to replicate the properties of a liquid. J.D. Bernal said, in 1962: "... I took a 
number of rubber balls and stuck them together with rods of a selection of different lengths 
ranging from 2.75 to 4 inches. I tried to do this in the first place as casually as possible, 
working in my own office, being interrupted every five minutes or so and not remembering 
what I had done before the interruption." Fortunately, now computers keep track of 
bonds during a simulation. 

Molecular dynamics is a specialized discipline of molecular modeling and computer 
simulation based on -» statistical mechanics; the main justification of the MD method is that 
statistical ensemble averages are equal to time averages of the system, known as the 
ergodic hypothesis. MD has also been termed "statistical mechanics by numbers" and 
"Laplace's vision of Newtonian mechanics" of predicting the future by animating nature's 
forces and allowing insight into molecular motion on an atomic scale. However, long 

MD simulations are mathematically ill-conditioned, generating cumulative errors in 
numerical integration that can be minimized with proper selection of algorithms and 
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parameters, but not eliminated entirely. Furthermore, current potential functions are, in 
many cases, not sufficiently accurate to reproduce the dynamics of molecular systems, so 
the much more computationally demanding Ab Initio Molecular Dynamics method must be 
used. Nevertheless, molecular dynamics techniques allow detailed time and space 
resolution into representative behavior in phase space. 



Areas of Application 

There is a significant difference 
between the focus and methods 
used by chemists and 
physicists, and this is reflected 
in differences in the jargon 
used by the different fields. In 
chemistry and biophysics, the 
interaction between the 
particles is either described by 
a "force field" (classical MD), 
a -» quantum chemical model, 
or a mix between the two. 
These terms are not used in 
physics, where the interactions 
are usually described by the 
name of the theory or 
approximation being used and 
called the potential energy, or just 
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Highly simplified description of the molecular dynamics simulation 

algorithm. The simulation proceeds iteratively by alternatively 

calculating forces and solving the equations of motion based on the 

accelerations obtained from the new forces. In practise, almost all 

MD codes use much more complicated versions of the algorithm, 

including two steps (predictor and corrector) in solving the equations 

of motion and many additional steps for e.g. temperature and 

pressure control, analysis and output. 



'potential". 



Beginning in theoretical physics, the method of MD gained popularity in materials science 
and since the 1970s also in biochemistry and biophysics. In chemistry, MD serves as an 
important tool in protein structure determination and refinement using experimental tools 
such as X-ray crystallography and NMR. It has also been applied with limited success as a 
method of refining protein structure predictions. In physics, MD is used to examine the 
dynamics of atomic-level phenomena that cannot be observed directly, such as thin film 
growth and ion-subplantation. It is also used to examine the physical properties of 
nanotechnological devices that have not or cannot yet be created. 

In applied mathematics and theoretical physics, molecular dynamics is a part of the 
research realm of dynamical systems, ergodic theory and -» statistical mechanics in 
general. The concepts of energy conservation and molecular entropy come from 
thermodynamics. Some techniques to calculate conformational entropy such as principal 
components analysis come from information theory. Mathematical techniques such as the 
transfer operator become applicable when MD is seen as a Markov chain. Also, there is a 
large community of mathematicians working on volume preserving, symplectic integrators 
for more computationally efficient MD simulations. 

MD can also be seen as a special case of the discrete element method (DEM) in which the 
particles have spherical shape (e.g. with the size of their van der Waals radii.) Some 
authors in the DEM community employ the term MD rather loosely, even when their 
simulations do not model actual molecules. 
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Design Constraints 

Design of a molecular dynamics simulation should account for the available computational 
power. Simulation size (n=number of particles), timestep and total time duration must be 
selected so that the calculation can finish within a reasonable time period. However, the 
simulations should be long enough to be relevant to the time scales of the natural processes 
being studied. To make statistically valid conclusions from the simulations, the time span 
simulated should match the kinetics of the natural process. Otherwise, it is analogous to 
making conclusions about how a human walks from less than one footstep. Most scientific 
publications about the dynamics of proteins and DNA use data from simulations spanning 
nanoseconds (1E-9 s) to microseconds (1E-6 s). To obtain these simulations, several 
CPU-days to CPU-years are needed. Parallel algorithms allow the load to be distributed 
among CPUs; an example is the spatial decomposition in LAMMPS. 

During a classical MD simulation, the most CPU intensive task is the evaluation of the 
potential (force field) as a function of the particles' internal coordinates. Within that energy 
evaluation, the most expensive one is the non-bonded or non-covalent part. In Big O 
notation, common molecular dynamics simulations scale by 0{n ) if all pair- wise 
electrostatic and van der Waals interactions must be accounted for explicitly. This 
computational cost can be reduced by employing electrostatics methods such as Particle 
Mesh Ewald ( 0(nlog(n)) ) or good spherical cutoff techniques ( 0(n) ). 

Another factor that impacts total CPU time required by a simulation is the size of the 
integration timestep. This is the time length between evaluations of the potential. The 
timestep must be chosen small enough to avoid discretization errors (i.e. smaller than the 
fastest vibrational frequency in the system). Typical timesteps for classical MD are in the 
order of 1 femtosecond (1E-15 s). This value may be extended by using algorithms such as 
SHAKE, which fix the vibrations of the fastest atoms (e.g. hydrogens) into place. Multiple 
time scale methods have also been developed, which allow for extended times between 
updates of slower long-range forces. 

For simulating molecules in a solvent, a choice should be made between explicit solvent and 
implicit solvent. Explicit solvent particles (such as the TIP3P and SPC/E water models) must 
be calculated expensively by the force field, while implicit solvents use a mean-field 
approach. Using an explicit solvent is computationally expensive, requiring inclusion of 
about ten times more particles in the simulation. But the granularity and viscosity of 
explicit solvent is essential to reproduce certain properties of the solute molecules. This is 
especially important to reproduce kinetics. 

In all kinds of molecular dynamics simulations, the simulation box size must be large 
enough to avoid boundary condition artifacts. Boundary conditions are often treated by 
choosing fixed values at the edges, or by employing periodic boundary conditions in which 
one side of the simulation loops back to the opposite side, mimicking a bulk phase. 

Microcanonical ensemble (NVE) 

In the microcanonical, or NVE ensemble, the system is isolated from changes in moles 
(N), volume (V) and energy (E). It corresponds to an adiabatic process with no heat 
exchange. A microcanonical molecular dynamics trajectory may be seen as an exchange of 
potential and kinetic energy, with total energy being conserved. For a system of N particles 
with coordinates X and velocities } ' ', the following pair of first order differential equations 
may be written in Newton's notation as 
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F{X) = -VU{X) = MV(t) 

V(t) = X(t). 
The potential energy function U{X) of the system is a function of the particle coordinates 
X . It is referred to simply as the "potential" in Physics, or the "force field" in Chemistry. 
The first equation comes from Newton's laws; the force F acting on each particle in the 
system can be calculated as the negative gradient of U{X) . 

For every timestep, each particle's position X and velocity V may be integrated with a 
symplectic method such as Verlet. The time evolution of X and V is called a trajectory. 
Given the initial positions (e.g. from theoretical knowledge) and velocities (e.g. randomized 
Gaussian), we can calculate all future (or past) positions and velocities. 

One frequent source of confusion is the meaning of temperature in MD. Commonly we have 
experience with macroscopic temperatures, which involve a huge number of particles. But 
temperature is a statistical quantity. If there is a large enough number of atoms, statistical 
temperature can be estimated from the instantaneous temperature, which is found by 
equating the kinetic energy of the system to nk T/2 where n is the number of degrees of 
freedom of the system. 

A temperature-related phenomenon arises due to the small number of atoms that are used 
in MD simulations. For example, consider simulating the growth of a copper film starting 
with a substrate containing 500 atoms and a deposition energy of 100 eV. In the real world, 
the 100 eV from the deposited atom would rapidly be transported through and shared 
among a large number of atoms ( 10 L0 or more) with no big change in temperature. When 
there are only 500 atoms, however, the substrate is almost immediately vaporized by the 
deposition. Something similar happens in biophysical simulations. The temperature of the 
system in NVE is naturally raised when macromolecules such as proteins undergo 
exothermic conformational changes and binding. 

Canonical ensemble (NVT) 

In the canonical ensemble, moles (N), volume (V) and temperature (T) are conserved. It is 
also sometimes called constant temperature molecular dynamics (CTMD). In NVT, the 
energy of endothermic and exothermic processes is exchanged with a thermostat. 

A variety of thermostat methods are available to add and remove energy from the 
boundaries of an MD system in a realistic way, approximating the canonical ensemble. 
Popular techniques to control temperature include the Nose-Hoover thermostat, the 
Berendsen thermostat, and Langevin dynamics. Note that the Berendsen thermostat might 
introduce the flying ice cube effect, which leads to unphysical translations and rotations of 
the simulated system. 
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Isothermal- Isobaric (NPT) ensemble 

In the isothermal-isobaric ensemble, moles (N), pressure (P) and temperature (T) are 
conserved. In addition to a thermostat, a barostat is needed. It corresponds most closely to 
laboratory conditions with a flask open to ambient temperature and pressure. 

In the simulation of biological membranes, isotropic pressure control is not appropriate. 
For lipid bilayers, pressure control occurs under constant membrane area (NPAT) or 
constant surface tension "gamma" (NPyT). 

Generalized ensembles 

The replica exchange method is a generalized ensemble. It was originally created to deal 
with the slow dynamics of disordered spin systems. It is also called parallel tempering. The 

rq] 

replica exchange MD (REMD) formulation tries to overcome the multiple-minima 

problem by exchanging the temperature of non-interacting replicas of the system running 
at several temperatures. 

Potentials in MD simulations 

A molecular dynamics simulation requires the definition of a potential function, or a 
description of the terms by which the particles in the simulation will interact. In chemistry 
and biology this is usually referred to as a force field. Potentials may be defined at many 
levels of physical accuracy; those most commonly used in chemistry are based on molecular 
mechanics and embody a classical treatment of particle-particle interactions that can 
reproduce structural and conformational changes but usually cannot reproduce chemical 
reactions. 

The reduction from a fully quantum description to a classical potential entails two main 
approximations. The first one is the Born-Oppenheimer approximation, which states that 
the dynamics of electrons is so fast that they can be considered to react instantaneously to 
the motion of their nuclei. As a consequence, they may be treated separately. The second 
one treats the nuclei, which are much heavier than electrons, as point particles that follow 
classical Newtonian dynamics. In classical molecular dynamics the effect of the electrons is 
approximated as a single potential energy surface, usually representing the ground state. 

When finer levels of detail are required, potentials based on quantum mechanics are used; 
some techniques attempt to create hybrid classical/quantum potentials where the bulk of 
the system is treated classically but a small region is treated as a quantum system, usually 
undergoing a chemical transformation. 

Empirical potentials 

Empirical potentials used in chemistry are frequently called force fields, while those used in 
materials physics are called just empirical or analytical potentials. 

Most force fields in chemistry are empirical and consist of a summation of bonded forces 
associated with chemical bonds, bond angles, and bond dihedrals, and non-bonded forces 
associated with van der Waals forces and electrostatic charge. Empirical potentials 
represent quantum-mechanical effects in a limited way through ad-hoc functional 
approximations. These potentials contain free parameters such as atomic charge, van der 
Waals parameters reflecting estimates of atomic radius, and equilibrium bond length, 
angle, and dihedral; these are obtained by fitting against detailed electronic calculations 
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(quantum chemical simulations) or experimental physical properties such as elastic 
constants, lattice parameters and spectroscopic measurements. 

Because of the non-local nature of non-bonded interactions, they involve at least weak 
interactions between all particles in the system. Its calculation is normally the bottleneck in 
the speed of MD simulations. To lower the computational cost, force fields employ 
numerical approximations such as shifted cutoff radii, reaction field algorithms, particle 
mesh Ewald summation, or the newer Particle-Particle Particle Mesh (P3M). 

Chemistry force fields commonly employ preset bonding arrangements (an exception being 
ab-initio dynamics), and thus are unable to model the process of chemical bond breaking 
and reactions explicitly. On the other hand, many of the potentials used in physics, such as 
those based on the bond order formalism can describe several different coordinations of a 
system and bond breaking. Examples of such potentials include the Brenner potential for 



hydrocarbons and its further developments for the C-Si-H and C-O-H systems. The ReaxFF 
potential can be co 
chemistry force fields. 



potential can be considered a fully reactive hybrid between bond order potentials and 



Pair potentials vs. many-body potentials 

The potential functions representing the non-bonded energy are formulated as a sum over 
interactions between the particles of the system. The simplest choice, employed in many 
popular force fields, is the "pair potential", in which the total potential energy can be 
calculated from the sum of energy contributions between pairs of atoms. An example of 
such a pair potential is the non-bonded Lennard -Jones potential (also known as the 6-12 
potential), used for calculating van der Waals forces. 



U(r) = ie 



r-(*r 



Another example is the Born (ionic) model of the ionic lattice. The first term in the next 
equation is Coulomb's law for a pair of ions, the second term is the short-range repulsion 
explained by Pauli's exclusion principle and the final term is the dispersion interaction 
term. Usually, a simulation only includes the dipolar term, although sometimes the 
quadrupolar term is included as well. 

%(^) = E 5rr + E A ex P =-p> + E <W + ■ ■ ■ 

In many-body potentials, the potential energy includes the effects of three or more particles 
interacting with each other. In simulations with pairwise potentials, global interactions in 
the system also exist, but they occur only through pairwise terms. In many-body potentials, 
the potential energy cannot be found by a sum over pairs of atoms, as these interactions are 
calculated explicitly as a combination of higher-order terms. In the statistical view, the 

dependency between the variables cannot in general be expressed using only pairwise 

ri2i 
products of the degrees of freedom. For example, the Tersoff potential , which was 

originally used to simulate carbon, silicon and germanium and has since been used for a 

wide range of other materials, involves a sum over groups of three atoms, with the angles 

between the atoms being an important factor in the potential. Other examples are the 

ri3i 
embedded-atom method (EAM) and the Tight-Binding Second Moment Approximation 

(TBSMA) potentials , where the electron density of states in the region of an atom is 

calculated from a sum of contributions from surrounding atoms, and the potential energy 

contribution is then a function of this sum. 
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Semi-empirical potentials 

Semi-empirical potentials make use of the matrix representation from quantum mechanics. 
However, the values of the matrix elements are found through empirical formulae that 
estimate the degree of overlap of specific atomic orbitals. The matrix is then diagonalized to 
determine the occupancy of the different atomic orbitals, and empirical formulae are used 
once again to determine the energy contributions of the orbitals. 

There are a wide variety of semi-empirical potentials, known as tight-binding potentials, 
which vary according to the atoms being modeled. 

Polarizable potentials 

Most classical force fields implicitly include the effect of polarizability, e.g. by scaling up 
the partial charges obtained from quantum chemical calculations. These partial charges are 
stationary with respect to the mass of the atom. But molecular dynamics simulations can 
explicitly model polarizability with the introduction of induced dipoles through different 
methods, such as Drude particles or fluctuating charges. This allows for a dynamic 
redistribution of charge between atoms which responds to the local chemical environment. 

For many years, polarizable MD simulations have been touted as the next generation. For 
homogenous liquids such as water, increased accuracy has been achieved through the 
inclusion of polarizability. Some promising results have also been achieved for 

proteins. ' However, it is still uncertain how to best approximate polarizability in a 
simulation. 

Ab-initio methods 

In classical molecular dynamics, a single potential energy surface (usually the ground state) 
is represented in the force field. This is a consequence of the Born-Oppenheimer 
approximation. If excited states, chemical reactions or a more accurate representation is 
needed, electronic behavior can be obtained from first principles by using a quantum 
mechanical method, such as Density Functional Theory. This is known as Ab Initio 
Molecular Dynamics (AIMD). Due to the cost of treating the electronic degrees of freedom, 
the computational cost of this simulations is much higher than classical molecular 
dynamics. This implies that AIMD is limited to smaller systems and shorter periods of time. 

Ab-initio -» quantum-mechanical methods may be used to calculate the potential energy of a 
system on the fly, as needed for conformations in a trajectory. This calculation is usually 
made in the close neighborhood of the reaction coordinate. Although various 
approximations may be used, these are based on theoretical considerations, not on 
empirical fitting. Ab-initio calculations produce a vast amount of information that is not 
available from empirical methods, such as density of electronic states or other electronic 
properties. A significant advantage of using ab-initio methods is the ability to study 
reactions that involve breaking or formation of covalent bonds, which correspond to 
multiple electronic states. 

A popular software for ab-initio molecular dynamics is the Car-Parrinello Molecular 
Dynamics (CPMD) package based on the density functional theory. 
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Hybrid QM/MM 

QM (quantum-mechanical) methods are very powerful. However, they are computationally 
expensive, while the MM (classical or molecular mechanics) methods are fast but suffer 
from several limitations (require extensive parameterization; enerqy estimates obtained are 
not very accurate; cannot be used to simulate reactions where covalent bonds are 
broken/formed; and are limited in their abilities for providinq accurate details reqardinq the 
chemical environment). A new class of method has emerqed that combines the qood points 
of QM (accuracy) and MM (speed) calculations. These methods are known as mixed or 
hybrid quantum-mechanical and molecular mechanics methods (hybrid QM/MM). The 
methodoloqy for such techniques was introduced by Warshel and coworkers. In the recent 
years have been pioneered by several qroups includinq: Arieh Warshel (University of 
Southern California), Weitao Yanq (Duke University), Sharon Hammes-Schiffer (The 
Pennsylvania State University), Donald Truhlar and Jiali Gao (University of Minnesota) and 
Kenneth Merz (University of Florida). 

The most important advantaqe of hybrid QM/MM methods is the speed. The cost of doinq 
classical molecular dynamics (MM) in the most straiqhtforward case scales 0(n ), where N 
is the number of atoms in the system. This is mainly due to electrostatic interactions term 
(every particle interacts with every other particle). However, use of cutoff radius, periodic 
pair-list updates and more recently the variations of the particle-mesh Ewald's (PME) 
method has reduced this between O(N) to 0(n ). In other words, if a system with twice 
many atoms is simulated then it would take between twice to four times as much computinq 
power. On the other hand the simplest ab-initio calculations typically scale 0(n ) or worse 

2 7 

(Restricted Hartree-Fock calculations have been suqqested to scale ~0(n ' )). To overcome 
the limitation, a small part of the system is treated quantum-mechanically (typically 
active-site of an enzyme) and the remaininq system is treated classically. 

In more sophisticated implementations, QM/MM methods exist to treat both liqht nuclei 
susceptible to quantum effects (such as hydroqens) and electronic states. This allows 
qeneration of hydroqen wave-functions (similar to electronic wave-functions). This 
methodoloqy has been useful in investiqatinq phenomenon such as hydroqen tunnelinq. One 
example where QM/MM methods have provided new discoveries is the calculation of 
hydride transfer in the enzyme liver alcohol dehydroqenase. In this case, tunnelinq is 
important for the hydroqen, as it determines the reaction rate. 

Coarse-graining and reduced representations 

At the other end of the detail scale are coarse-qrained and lattice models. Instead of 
explicitly representinq every atom of the system, one uses "pseudo-atoms" to represent 
qroups of atoms. MD simulations on very larqe systems may require such larqe computer 
resources that they cannot easily be studied by traditional all-atom methods. Similarly, 
simulations of processes on lonq timescales (beyond about 1 microsecond) are prohibitively 
expensive, because they require so many timesteps. In these cases, one can sometimes 
tackle the problem by usinq reduced representations, which are also called coarse-qrained 
models. 

Examples for coarse qraininq (CG) methods are discontinuous molecular dynamics 
(CG-DMD) and Go-models . Coarse-qraininq is done sometimes takinq larqer 

pseudo-atoms. Such united atom approximations have been used in MD simulations of 
bioloqical membranes. The aliphatic tails of lipids are represented by a few pseudo-atoms 
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by gathering 2-4 methylene groups into each pseudo-atom. 

The parameterization of these very coarse-grained models must be done empirically, by 
matching the behavior of the model to appropriate experimental data or all-atom 
simulations. Ideally, these parameters should account for both enthalpic and entropic 
contributions to free energy in an implicit way. When coarse-graining is done at higher 
levels, the accuracy of the dynamic description may be less reliable. But very 
coarse-grained models have been used successfully to examine a wide range of questions in 
structural biology. 

Examples of applications of coarse-graining in biophysics: 

• protein folding studies are often carried out using a single (or a few) pseudo-atoms per 
amino acid; 

• DNA supercoiling has been investigated using 1-3 pseudo-atoms per basepair, and at 
even lower resolution; 

• Packaging of double-helical DNA into bacteriophage has been investigated with models 
where one pseudo-atom represents one turn (about 10 basepairs) of the double helix; 

• RNA structure in the ribosome and other large systems has been modeled with one 
pseudo-atom per nucleotide. 

The simplest form of coarse-graining is the "united atom" (sometimes called "extended 
atom") and was used in most early MD simulations of proteins, lipids and nucleic acids. For 
example, instead of treating all four atoms of a CH methyl group explicitly (or all three 
atoms of CH methylene group), one represents the whole group with a single pseudo-atom. 
This pseudo-atom must, of course, be properly parameterized so that its van der Waals 
interactions with other groups have the proper distance-dependence. Similar 
considerations apply to the bonds, angles, and torsions in which the pseudo-atom 
participates. In this kind of united atom representation, one typically eliminates all explicit 
hydrogen atoms except those that have the capability to participate in hydrogen bonds 
("polar hydrogens"). An example of this is the Charmm 19 force-field. 

The polar hydrogens are usually retained in the model, because proper treatment of 
hydrogen bonds requires a reasonably accurate description of the directionality and the 
electrostatic interactions between the donor and acceptor groups. A hydroxyl group, for 
example, can be both a hydrogen bond donor and a hydrogen bond acceptor, and it would 
be impossible to treat this with a single OH pseudo-atom. Note that about half the atoms in 
a protein or nucleic acid are nonpolar hydrogens, so the use of united atoms can provide a 
substantial savings in computer time. 

Examples of applications 

Molecular dynamics is used in many fields of science. 

• First macromolecular MD simulation published (1977, Size: 500 atoms, Simulation Time: 
9.2 ps=0.0092 ns, Program: CHARMM precursor) Protein: Bovine Pancreatic Trypsine 
Inhibitor. This is one of the best studied proteins in terms of folding and kinetics. Its 
simulation published in Nature magazine paved the way for understanding protein 
motion as essential in function and not just accessory. 

• MD is the standard method to treat collision cascades in the heat spike regime, i.e. the 

["221 T231 

effects that energetic neutron and ion irradiation have on solids an solid surfaces. 
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The following two biophysical examples are not run-of-the-mill MD simulations. They 
illustrate almost heroic efforts to produce simulations of a system of very large size (a 
complete virus) and very long simulation times (500 microseconds): 

• MD simulation of the complete satellite tobacco mosaic virus (STMV) (2006, Size: 1 
million atoms, Simulation time: 50 ns, program: NAMD) This virus is a small, icosahedral 
plant virus which worsens the symptoms of infection by Tobacco Mosaic Virus (TMV). 
Molecular dynamics simulations were used to probe the mechanisms of viral assembly. 
The entire STMV particle consists of 60 identical copies of a single protein that make up 
the viral capsid (coating), and a 1063 nucleotide single stranded RNA genome. One key 
finding is that the capsid is very unstable when there is no RNA inside. The simulation 
would take a single 2006 desktop computer around 35 years to complete. It was thus 
done in many processors in parallel with continuous communication between them. ] 

• Folding Simulations of the Villin Headpiece in All-Atom Detail (2006, Size: 20,000 atoms; 
Simulation time: 500 us = 500,000 ns, Program: folding@home) This simulation was run 
in 200,000 CPU's of participating personal computers around the world. These 
computers had the folding@home program installed, a large-scale distributed computing 
effort coordinated by Vijay Pande at Stanford University. The kinetic properties of the 
Villin Headpiece protein were probed by using many independent, short trajectories run 
by CPU's without continuous real-time communication. One technique employed was the 
Pfold value analysis, which measures the probability of folding before unfolding of a 
specific starting conformation. Pfold gives information about transition state structures 

and an ordering of conformations along the folding pathway. Each trajectory in a Pfold 

\i si 
calculation can be relatively short, but many independent trajectories are needed. 

Molecular dynamics algorithms 
Integrators 

• Verlet-Stoermer integration 

• Runge-Kutta integration 

• Beeman's algorithm 

• Gear predictor - corrector 

• Constraint algorithms (for constrained systems) 

• Symplectic integrator 

Short-range interaction algorithms 

• Cell lists 

• Verlet list 

• Bonded interactions 

Long-range interaction algorithms 

• Ewald summation 

• Particle Mesh Ewald (PME) 

• Particle-Particle Particle Mesh P3M 

• Reaction Field Method 
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Parallelization strategies 

• Domain decomposition method (Distribution of system data for parallel computing) 

• Molecular Dynamics - Parallel Algorithms 

Major software for MD simulations 

Abalone (classical, implicit water) 

ABINIT (DFT) 

ACEMD [27] (running on NVIDIA GPUs: heavily optimized with CUDA) 

T281 

ADUN (classical, P2P database for simulations) 
AMBER (classical) 

T291 

Ascalaph (classical, GPU accelerated) 

CASTEP (DFT) 

CPMD (DFT) 

CP2K [30] (DFT) 

CHARMM (classical, the pioneer in MD simulation, extensive analysis tools) 

COSMOS (classical and hybrid QM/MM, quantum-mechanical atomic charges with 

BPT) 

T321 

Desmond (classical, parallelization with up to thousands of CPU's) 

DLPOLY [33] (classical) 

ESPResSo (classical, coarse-grained, parallel, extensible) 

Fireball [34] (tight-binding DFT) 

GROMACS (classical) 

GROMOS (classical) 

GULP (classical) 

Hippo (classical) 

LAMMPS (classical, large-scale with spatial-decomposition of simulation domain for 

parallelism) 

MDynaMix (classical, parallel) 

MOLDY [36] (classical, parallel) latest release [37] 

Materials Studio [38] (Forcite MD using COMPASS, Dreiding, Universal, cvff and pcff 

forcefields in serial or parallel, QMERA (QM+MD), ONESTEP (DFT), etc.) 

MOSCITO (classical) 

NAMD (classical, parallelization with up to thousands of CPU's) 

T391 

NEWTON-X (ab initio, surface-hopping dynamics) 

ProtoMol [ ] (classical, extensible, includes multigrid electrostatics) 

PWscf (DFT) 

S/PHI/nX [41] (DFT) 

SIESTA (DFT) 

VASP (DFT) 

TINKER (classical) 

YASARA [42] (classical) 

ORAC [43] (classical) 

XMD (classical) 
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Related software 

• VMD - MD simulation trajectories can be visualized and analyzed. 

• PyMol - Molecular Visualization software written in python 

• Packmol Package for building starting configurations for MD in an automated fashion 

• Sirius - Molecular modeling, analysis and visualization of MD trajectories 

• esra - Lightweight molecular modeling and analysis library 
(Java/Jython/Mathematica). 

• Molecular Workbench - Interactive molecular dynamics simulations on your desktop 

• BOSS - MC in OPLS 

Specialized hardware for MD simulations 

• Anton - A specialized, massively parallel supercomputer designed to execute MD 
simulations. 

• MDGRAPE - A special purpose system built for molecular dynamics simulations, 
especially protein structure prediction. 

See also 

-» Molecular graphics 

Molecular modeling 

Computational chemistry 

Energy drift 

Force field in Chemistry 

Force field implementation 

-» Monte Carlo method 

Molecular Design software 

Molecular mechanics 

Molecular modeling on GPU 

Protein dynamics 

Implicit solvation 

Car-Parrinello method 

Symplectic numerical integration 

Software for molecular mechanics modeling 

Dynamical systems 

Theoretical chemistry 

-» Statistical mechanics 

-» Quantum chemistry 

Discrete element method 

List of nucleic acid simulation software 
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Molecular modelling 



Molecular modelling is a collective term that refers to 
theoretical methods and computational techniques to 
model or mimic the behaviour of molecules. The 
techniques are used in the fields of computational 
chemistry, computational biology and materials science 
for studying molecular systems ranging from small 
chemical systems to large biological molecules and 
material assemblies. The simplest calculations can be 
performed by hand, but inevitably computers are 
required to perform molecular modelling of any 
reasonably sized system. The common feature of 
molecular modelling techniques is the atomistic level 
description of the molecular systems; the lowest level of 
information is individual atoms (or a small group of 
atoms). This is in contrast to -» quantum chemistry (also 
known as electronic structure calculations) where 
electrons are considered explicitly. The benefit of 
molecular modelling is that it reduces the complexity of 
the system, allowing many more particles (atoms) to be 
considered during simulations. 

Molecular mechanics is one aspect of molecular 

modelling, as it is refers to the use of classical 

mechanics/Newtonian mechanics to describe the 

physical basis behind the models. Molecular models 

typically describe atoms (nucleus and electrons 

collectively) as point charges with an associated mass. 

The interactions between neighbouring atoms are 

described by spring-like interactions (representing 

chemical bonds) and van der Waals forces. The 

Lennard-Jones potential is commonly used to describe 

van der Waals forces. The electrostatic interactions are 

computed based on Coulomb's law. Atoms are assigned 

coordinates in Cartesian space or in internal 

coordinates, and can also be assigned velocities in 

dynamical simulations. The atomic velocities are related 

to the temperature of the system, a macroscopic 

quantity. The collective mathematical expression is 

known as a potential function and is related to the 

system internal energy (U), a thermodynamic quantity equal to the sum of potential and 

kinetic energies. Methods which minimize the potential energy are known as energy 

minimization techniques (e.g., steepest descent and conjugate gradient), while methods 

that model the behaviour of the system with propagation of time are known as -» molecular 

dynamics. 




The backbone dihedral angles are 

included in the molecular model of 

a protein. 




Modelling of ionic liquid 
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■*-' ^bonds ~r *-j angle ~T ^dihedral ~T ^non— bonded 
^non— bonded ^electrostatic ~v ^vanderWaals 

This function, referred to as a potential function, computes the molecular potential energy 
as a sum of energy terms that describe the deviation of bond lengths, bond angles and 
torsion angles away from equilibrium values, plus terms for non-bonded pairs of atoms 
describing van der Waals and electrostatic interactions. The set of parameters consisting of 
equilibrium bond lengths, bond angles, partial charge values, force constants and van der 
Waals parameters are collectively known as a force field. Different implementations of 
molecular mechanics use slightly different mathematical expressions, and therefore, 
different constants for the potential function. The common force fields in use today have 
been developed by using high level quantum calculations and/or fitting to experimental 
data. The technique known as energy minimization is used to find positions of zero gradient 
for all atoms, in other words, a local energy minimum. Lower energy states are more stable 
and are commonly investigated because of their role in chemical and biological processes. A 
-» molecular dynamics simulation, on the other hand, computes the behaviour of a system 
as a function of time. It involves solving Newton's laws of motion, principally the second 
law, F = ma. Integration of Newton's laws of motion, using different integration algorithms, 
leads to atomic trajectories in space and time. The force on an atom is defined as the 
negative gradient of the potential energy function. The energy minimization technique is 
useful for obtaining a static picture for comparing between states of similar systems, while 
molecular dynamics provides information about the dynamic processes with the intrinsic 
inclusion of temperature effects. 

Molecules can be modelled either in vacuum or in the presence of a solvent such as water. 
Simulations of systems in vacuum are referred to as gas-phase simulations, while those that 
include the presence of solvent molecules are referred to as explicit solvent simulations. In 
another type of simulation, the effect of solvent is estimated using an empirical 
mathematical expression; these are known as implicit solvation simulations. 

Molecular modelling methods are now routinely used to investigate the structure, dynamics 
and thermodynamics of inorganic, biological, and polymeric systems. The types of biological 
activity that have been investigated using molecular modelling include protein folding, 
enzyme catalysis, protein stability, conformational changes associated with biomolecular 
function, and molecular recognition of proteins, DNA, and membrane complexes. 

Popular software for molecular modelling 

Abalone 

AMBER 

ADF 

Ascalaph Designer 

BALLView 

Biskit 

BOSS 

Cerius2 

Chimera 

CHARMM 

Coot (program) 1 for X-ray crystallography of biological molecules 



COSMOS (software) [3] 
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CP2K 

CPMD 

Firefly 

GAMESS (UK) 

GAMESS (US) 

GAUSSIAN 

Ghemical 

GROMACS 

GROMOS 

Insightll 

LAMMPS 

MacroModel 

MarvinSpace 

Materials Studio 

MDynaMix 

MMTK 

MOE (software) [5] 

Molecular Docking Server 

Molsoft ICM [6] 

MOPAC 

NAMD 

NOCH 

Oscail X 

PyMOL 

Q-Chem 

Sirius 

SPARTAN (software) [7] 

STR3DI32 [8] 

Sybyl (software) [9] 

MCCCS Towhee [10] 

TURBOMOLE 

ReaxFF 

VMD 

WHAT IF [11] 

xeo [12] 

YASARA [13] 

Zodiac (software) [14] 
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See also 

Cheminformatics 

Computational chemistry 

Density functional theory programs. 

Force field in Chemistry 

Force field implementation 

List of nucleic acid simulation software 

List of protein structure prediction software 

Molecular Design software 

-» Molecular dynamics 

-» Molecular graphics 

Molecular mechanics 

Molecular model 

Molecular modelling on GPU 

Molecule editor 

-» Monte Carlo method 

Quantum chemistry computer programs 

Semi-empirical quantum chemistry method 

Software for molecular mechanics modelling 

Structural Bioinformatics 

External links 

• Center for Molecular Modeling at the National Institutes of Health (NIH) [15] (U.S. 
Government Agency) 

• Molecular Simulation [16] , details for the Molecular Simulation journal ISSN: 0892-7022 
(print), 1029-0435 (online) 

• The eCheminfo Network and Community of Practice in Informatics and Modeling 
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Homepage 

[I] Agile Molecule (http://www.agilemolecule.com/index.html) 

[2] York Structural Biology Laboratory (http://www.ysbl.york.ac.uk/~emsley/coot/) 

[3] COSMOS (http://www.cosmos-software.de/ce_intro.html) - Computer Simulation of Molecular Structures 

[4] ChemAxon (http://www.chemaxon.com/product/mspace.html) 

[5] MOE - Molecular Operating Environment, Chemical Computing Group (http://www.chemcomp.com/) 

[6] Molsoft (http://www.molsoft.com/) 

[7] Wave function. Inc. (http://www.wavefun.com/) 

[8] Exorga, Inc. (http://www.exorga.com/) 

[9] Tipos (http://www.tripos.com/sybyl/) 

[10] MCCCS Towhee (http://towhee.sourceforge.net/) - Monte Carlo for Complex Chemical Systems 

[II] CMBI (http://swift.cmbi.ru.nl/whatif/) 
[12] xeo (http://sourceforge.net/projects/xeo) 
[13] YASARA (http://www.yasara.org/) 

[14] ZedeN (http://www.zeden.org) 

[15] http://cmm.info.nih.gov/modeling/ 

[16] http://www.tandf.co.uk/journals/titles/08927022.asp 

[17] http://www.echeminfo.com/ 

[18] http://www.amrita.edu/cen/ccmm 
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Monte Carlo methods are a class of computational algorithms that rely on repeated 
random sampling to compute their results. Monte Carlo methods are often used when 
simulating physical and mathematical systems. Because of their reliance on repeated 
computation and random or pseudo-random numbers, Monte Carlo methods are most 
suited to calculation by a computer. Monte Carlo methods tend to be used when it is 
unfeasible or impossible to compute an exact result with a deterministic algorithm.' ' 

Monte Carlo simulation methods are especially useful in studying systems with a large 
number of coupled degrees of freedom, such as fluids, disordered materials, strongly 
coupled solids, and cellular structures (see cellular Potts model). More broadly, Monte 
Carlo methods are useful for modeling phenomena with significant uncertainty in inputs, 
such as the calculation of risk in business. These methods are also widely used in 
mathematics: a classic use is for the evaluation of definite integrals, particularly 
multidimensional integrals with complicated boundary conditions. It is a widely successful 
method in risk analysis when compared to alternative methods or human intuition. When 
Monte Carlo simulations have been applied in space exploration and oil exploration, actual 
observations of failures, cost overruns and schedule overruns are routinely better predicted 
by the simulations than by human intuition or alternative "soft" methods. 

The term "Monte Carlo method" was coined in the 1940s by physicists working on nuclear 
weapon projects in the Los Alamos National Laboratory. 
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Overview 

There is no single Monte Carlo method; instead, the 
term describes a large and widely-used class of 
approaches. However, these approaches tend to follow 
a particular pattern: 

1. Define a domain of possible inputs. 

2. Generate inputs randomly from the domain. 

3. Perform a deterministic computation using the 
inputs. 

4. Aggregate the results of the individual computations 
into the final result. 

For example, the value of n can be approximated using 
a Monte Carlo method: 

1. Draw a square on the ground, then inscribe a circle 
within it. From plane geometry, the ratio of the area 
of an inscribed circle to that of the surrounding 
square is n/4. 

2. Uniformly scatter some objects of uniform size 
throughout the square. For example, grains of rice or 
sand. 

3. Since the two areas are in the ratio n/4, the objects 
should fall in the areas in approximately the same 
ratio. Thus, counting the number of objects in the 
circle and dividing by the total number of objects in 
the square will yield an approximation for n/4. 
Multiplying the result by 4 will then yield an 
approximation for n itself. 
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The Monte Carlo method can be 

illustrated as a game of battleship. 

First a player makes some random 

shots. Next the player applies 

algorithms (i.e. a battleship is four dots 

in the vertical or horizontal direction). 

Finally based on the outcome of the 

random sampling and the algorithm 

the player can determine the likely 

locations of the other player's ships. 



Notice how the n approximation follows the general 

pattern of Monte Carlo algorithms. First, we define a domain of inputs: in this case, it's the 
square which circumscribes our circle. Next, we generate inputs randomly (scatter 
individual grains within the square), then perform a computation on each input (test 
whether it falls within the circle). At the end, we aggregate the results into our final result, 
the approximation of n. Note, also, two other common properties of Monte Carlo methods: 
the computation's reliance on good random numbers, and its slow convergence to a better 
approximation as more data points are sampled. If grains are purposefully dropped into 
only, for example, the center of the circle, they will not be uniformly distributed, and so our 
approximation will be poor. An approximation will also be poor if only a few grains are 
randomly dropped into the whole square. Thus, the approximation of n will become more 
accurate both as the grains are dropped more uniformly and as more are dropped. 
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History 

The name "Monte Carlo" was popularized by physics researchers Stanislaw Ulam, Enrico 
Fermi, John von Neumann, and Nicholas Metropolis, among others; the name is a reference 
to the Monte Carlo Casino in Monaco where Ulam's uncle would borrow money to 
gamble. The use of randomness and the repetitive nature of the process are analogous to 
the activities conducted at a casino. 

Random methods of computation and experimentation (generally considered forms of 
stochastic simulation) can be arguably traced back to the earliest pioneers of probability 
theory (see, e.g., Buffon's needle, and the work on small samples by William Sealy Gosset), 
but are more specifically traced to the pre-electronic computing era. The general difference 
usually described about a Monte Carlo form of simulation is that it systematically "inverts" 
the typical mode of simulation, treating deterministic problems by first finding a 
probabilistic analog (see Simulated annealing). Previous methods of simulation and 
statistical sampling generally did the opposite: using simulation to test a previously 
understood deterministic problem. Though examples of an "inverted" approach do exist 
historically, they were not considered a general method until the popularity of the Monte 
Carlo method spread. 

Perhaps the most famous early use was by Enrico Fermi in 1930, when he used a random 
method to calculate the properties of the newly-discovered neutron. Monte Carlo methods 
were central to the simulations required for the Manhattan Project, though were severely 
limited by the computational tools at the time. Therefore, it was only after electronic 
computers were first built (from 1945 on) that Monte Carlo methods began to be studied in 
depth. In the 1950s they were used at Los Alamos for early work relating to the 
development of the hydrogen bomb, and became popularized in the fields of physics, 
physical chemistry, and operations research. The Rand Corporation and the U.S. Air Force 
were two of the major organizations responsible for funding and disseminating information 
on Monte Carlo methods during this time, and they began to find a wide application in 
many different fields. 

Uses of Monte Carlo methods require large amounts of random numbers, and it was their 
use that spurred the development of pseudorandom number generators, which were far 
quicker to use than the tables of random numbers which had been previously used for 
statistical sampling. 

Applications 

As mentioned, Monte Carlo simulation methods are especially useful for modeling 
phenomena with significant uncertainty in inputs and in studying systems with a large 
number of coupled degrees of freedom. Specific areas of application include: 

Physical sciences 

Monte Carlo methods are very important in computational physics, physical chemistry, and 
related applied fields, and have diverse applications from complicated quantum 
chromodynamics calculations to designing heat shields and aerodynamic forms. The Monte 
Carlo method is widely used in -» statistical physics, in particular, -» Monte Carlo molecular 
modeling as an alternative for computational -» molecular dynamics; see Monte Carlo 
method in statistical physics. In experimental particle physics, these methods are used for 
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designing detectors, understanding their behavior and comparing experimental data to 
theory. 

Design and visuals 

Monte Carlo methods have also proven efficient in solving coupled integral differential 
equations of radiation fields and energy transport, and thus these methods have been used 
in global illumination computations which produce photorealistic images of virtual 3D 
models, with applications in video games, architecture, design, computer generated films, 
special effects in cinema. 

Finance and business 

Monte Carlo methods in finance are often used to calculate the value of companies, to 
evaluate investments in projects at corporate level or to evaluate financial derivatives. The 
Monte Carlo method is intended for financial analysts who want to construct stochastic or 
probabilistic financial models as opposed to the traditional static and deterministic models. 
For its use in the insurance industry, see stochastic modelling. 

Telecommunications 

When planning a wireless network, design must be proved to work for a wide variety of 
scenarios that depend mainly on the number of users, their locations and the services they 
want to use. Monte Carlo methods are typically used to generate these users and their 
states. The network performance is then evaluated and, if results are not satisfactory, the 
network design goes through an optimization process. 

Games 

Monte Carlo methods have recently been applied in game playing related artificial 
intelligence theory. Most notably the game of Go has seen remarkably successful Monte 
Carlo algorithm based computer players. One of the main problems that this approach has 
in game playing is that it sometimes misses an isolated, very good move. These approaches 
are often strong strategically but weak tactically, as tactical decisions tend to rely on a 
small number of crucial moves which are easily missed by the randomly searching Monte 
Carlo algorithm. 

Monte Carlo simulation versus "what if" scenarios 

The opposite of Monte Carlo simulation might be considered deterministic modeling using 
single-point estimates. Each uncertain variable within a model is assigned a "best guess" 
estimate. Various combinations of each input variable are manually chosen (such as best 
case, worst case, and most likely case), and the results recorded for each so-called "what if" 

[5] 

scenario. 

By contrast, Monte Carlo simulation considers random sampling of probability distribution 
functions as model inputs to produce hundreds or thousands of possible outcomes instead 
of a few discrete scenarios. The results provide probabilities of different outcomes 
occurring. For example, a comparison of a spreadsheet cost construction model run 
using traditional "what if" scenarios, and then run again with Monte Carlo simulation and 
Triangular probability distributions shows that the Monte Carlo analysis has a narrower 
range than the "what if" analysis. This is because the "what if" analysis gives equal weight 
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to all scenarios. ' 

For an application, see quantifying uncertainty under corporate finance. 

Use in mathematics 

In general, Monte Carlo methods are used in mathematics to solve various problems by 
generating suitable random numbers and observing that fraction of the numbers obeying 
some property or properties. The method is useful for obtaining numerical solutions to 
problems which are too complicated to solve analytically. The most common application of 
the Monte Carlo method is Monte Carlo integration. 

Integration 

Deterministic methods of numerical integration operate by taking a number of evenly 
spaced samples from a function. In general, this works very well for functions of one 
variable. However, for functions of vectors, deterministic quadrature methods can be very 
inefficient. To numerically integrate a function of a two-dimensional vector, equally spaced 
grid points over a two-dimensional surface are required. For instance a 10x10 grid requires 
100 points. If the vector has 100 dimensions, the same spacing on the grid would require 
10 points— far too many to be computed. 100 dimensions is by no means unreasonable, 
since in many physical problems, a "dimension" is equivalent to a degree of freedom. (See 
Curse of dimensionality.) 

Monte Carlo methods provide a way out of this exponential time-increase. As long as the 
function in question is reasonably well-behaved, it can be estimated by randomly selecting 
points in 100-dimensional space, and taking some kind of average of the function values at 
these points. By the law of large numbers, this method will display 1/viV 
convergence— i.e. quadrupling the number of sampled points will halve the error, 
regardless of the number of dimensions. 

A refinement of this method is to somehow make the points random, but more likely to 
come from regions of high contribution to the integral than from regions of low 
contribution. In other words, the points should be drawn from a distribution similar in form 
to the integrand. Understandably, doing this precisely is just as difficult as solving the 
integral in the first place, but there are approximate methods available: from simply making 
up an integrable function thought to be similar, to one of the adaptive routines discussed in 
the topics listed below. 

A similar approach involves using low-discrepancy sequences instead— the -» quasi-Monte 
Carlo method. Quasi-Monte Carlo methods can often be more efficient at numerical 
integration because the sequence "fills" the area better in a sense and samples more of the 
most important points that can make the simulation converge to the desired solution more 
quickly. 
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Integration methods 

• Direct sampling methods 

• Importance sampling 

• Stratified sampling 

• Recursive stratified sampling 

• VEGAS algorithm 

• Random walk Monte Carlo including Markov chains 

• Metropolis-Hastings algorithm 

• Gibbs sampling 

Optimization 

Another powerful and very popular application for random numbers in numerical simulation 
is in numerical optimization. These problems use functions of some often large-dimensional 
vector that are to be minimized (or maximized). Many problems can be phrased in this way: 
for example a computer chess program could be seen as trying to find the optimal set of, 
say, 10 moves which produces the best evaluation function at the end. The traveling 
salesman problem is another optimization problem. There are also applications to 
engineering design, such as multidisciplinary design optimization. 

Most Monte Carlo optimization methods are based on random walks. Essentially, the 
program will move around a marker in multi-dimensional space, tending to move in 
directions which lead to a lower function, but sometimes moving against the gradient. 

Optimization methods 

• Evolution strategy 

• Genetic algorithms 

• Parallel tempering 

• Simulated annealing 

• Stochastic optimization 

• Stochastic tunneling 

Inverse problems 

Probabilistic formulation of inverse problems leads to the definition of a probability 
distribution in the model space. This probability distribution combines a priori information 
with new information obtained by measuring some observable parameters (data). As, in the 
general case, the theory linking data with model parameters is nonlinear, the a posteriori 
probability in the model space may not be easy to describe (it may be multimodal, some 
moments may not be defined, etc.). 

When analyzing an inverse problem, obtaining a maximum likelihood model is usually not 
sufficient, as we normally also wish to have information on the resolution power of the data. 
In the general case we may have a large number of model parameters, and an inspection of 
the marginal probability densities of interest may be impractical, or even useless. But it is 
possible to pseudorandomly generate a large collection of models according to the posterior 
probability distribution and to analyze and display the models in such a way that 
information on the relative likelihoods of model properties is conveyed to the spectator. 
This can be accomplished by means of an efficient Monte Carlo method, even in cases 
where no explicit formula for the a priori distribution is available. 
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The best-known importance sampling method, the Metropolis algorithm, can be 
generalized, and this gives a method that allows analysis of (possibly highly nonlinear) 
inverse problems with complex a priori information and data with an arbitrary noise 
distribution. For details, see Mosegaard and Tarantola (1995), or Tarantola (2005). 

Computational mathematics 

Monte Carlo methods are useful in many areas of computational mathematics, where a 
lucky choice can find the correct result. A classic example is Rabin's algorithm for primality 
testing: for any n which is not prime, a random x has at least a 75% chance of proving that 
n is not prime. Hence, if n is not prime, but x says that it might be, we have observed at 
most a l-in-4 event. If 10 different random x say that "n is probably prime" when it is not, 
we have observed a one-in-a-million event. In general a Monte Carlo algorithm of this kind 
produces one correct answer with a guarantee n is composite, and x proves it so, but 
another one without, but with a guarantee of not getting this answer when it is wrong too 
often — in this case at most 25% of the time. See also Las Vegas algorithm for a related, 
but different, idea. 

Monte Carlo and random numbers 

Interestingly, Monte Carlo simulation methods do not always require truly random numbers 
to be useful — while for some applications, such as primality testing, unpredictability is 
vital (see Davenport (1995)). Many of the most useful techniques use deterministic, 
pseudo-random sequences, making it easy to test and re-run simulations. The only quality 
usually necessary to make good simulations is for the pseudo-random sequence to appear 
"random enough" in a certain sense. 

What this means depends on the application, but typically they should pass a series of 
statistical tests. Testing that the numbers are uniformly distributed or follow another 
desired distribution when a large enough number of elements of the sequence are 
considered is one of the simplest, and most common ones. 

See also 

General 

Auxiliary field Monte Carlo 
Bootstrapping (statistics) 
Demon algorithm 
Evolutionary Computation 
Las Vegas algorithm 
Markov chain 
-» Molecular dynamics 
Monte Carlo option model 
-» Monte Carlo integration 
-» Quasi-Monte Carlo method 
Random number generator 
Randomness 
Resampling (statistics) 
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Application areas 

• Graphics, particularly for ray tracing; a version of the Metropolis-Hastings algorithm is 
also used for ray tracing where it is known as Metropolis light transport 
Modeling light transport in biological tissue 
Monte Carlo methods in finance 
Reliability engineering 

In simulated annealing for protein structure prediction 

In semiconductor device research, to model the transport of current carriers 
Environmental science, dealing with contaminant behavior 

Search And Rescue and Counter-Pollution. Models used to predict the drift of a life raft 
or movement of an oil slick at sea. 

In probabilistic design for simulating and understanding the effects of variability 
In physical chemistry, particularly for simulations involving atomic clusters 
In biomolecular simulations 
In polymer physics 

• Bond fluctuation model 
In computer science 

• Las Vegas algorithm 

• LURCH 

• Computer go 

• General Game Playing 

Modeling the movement of impurity atoms (or ions) in plasmas in existing and tokamaks 
(e.g.: DIVIMP). 
Nuclear and particle physics codes using the Monte Carlo method: 

GEANT — CERN's simulation of high energy particles interacting with a detector. 
CompHEP, PYTHIA — Monte-Carlo generators of particle collisions 
MCNP(X) - LANL's radiation transport codes 

MCU: universal computer code for simulation of particle transport (neutrons, photons, 
electrons) in three-dimensional systems by means of the Monte Carlo method 
EGS — Stanford's simulation code for coupled transport of electrons and photons 
PEREGRINE: LLNL's Monte Carlo tool for radiation therapy dose calculations 
BEAMnrc — Monte Carlo code system for modeling radiotherapy sources (LINAC's) 
PENELOPE — Monte Carlo for coupled transport of photons and electrons, with 
applications in radiotherapy 

• MONK — Serco Assurance's code for the calculation of k-effective of nuclear systems 
Modelling of foam and cellular structures 
Modeling of tissue morphogenesis 
Computation of holograms 
Phylogenetic analysis, i.e. Bayesian inference, Markov chain Monte Carlo 
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Other methods employing Monte Carlo 

Assorted random models, e.g. self-organised criticality 

Direct simulation Monte Carlo 

Dynamic Monte Carlo method 

Kinetic Monte Carlo 

-» Quantum Monte Carlo 

-» Quasi-Monte Carlo method using low-discrepancy sequences and self avoiding walks 

Semiconductor charge transport and the like 

Electron microscopy beam-sample interactions 

Stochastic optimization 

Cellular Potts model 

Markov chain Monte Carlo 

Cross-entropy method 

Applied information economics 

Monte Carlo localization 

Notes 

[1] Douglas Hubbard "How to Measure Anything: Finding the Value of Intangibles in Business" pg. 46, John Wiley 

& Sons, 2007 
[2] Douglas Hubbard "The Failure of Risk Management: Why It's Broken and How to Fix It", John Wiley & Sons, 

2009 
[3] Nicholas Metropolis (1987), " The beginning of the Monte Carlo method (http://library.lanl.gov/la-pubs/ 

00326866.pdf)", Los Alamos Science (1987 Special Issue dedicated to Stanislaw Ulam): 125-130, 
[4] Douglas Hubbard "How to Measure Anything: Finding the Value of Intangibles in Business" pg. 46, John Wiley 

& Sons, 2007 
[5] David Vose: "Risk Analysis, A Quantitative Guide," Second Edition, p. 13, John Wiley & Sons, 2000. 
[6] Ibid, p. 16 

[7] Ibid, p. 17, showing graph 

[8] http://www.ipgp.jussieu.fr/~tarantola/Files/Professional/Papers_PDF/MonteCarlo_latex.pdf 
[9] http://www.ipgp.jussieu.fr/~tarantola/Files/Professional/SIAM/index.html 
[10] Davenport, J. H.. "Primality testing revisited" (http://doi.acm.org/10.1145/143242.143290). doi: 

http://doi.acm.org/10.1145/143242.143290 (http://dx.doi.Org/http://doi.acm.org/10.1145/143242. 

143290). . Retrieved on 2007-08-19. 

References 

• Metropolis, N.; Ulam, S. (1949). "The Monte Carlo Method". Journal of the American 
Statistical Association 44 (247): 335-341. doi: 10.2307/2280232 (http://dx.doi.org/10. 
2307/2280232). 

• Metropolis, Nicholas; Rosenbluth, Arianna W.; Rosenbluth, Marshall N.; Teller, Augusta 
H.; Teller, Edward (1953). "Equation of State Calculations by Fast Computing Machines". 
Journal of Chemical Physics 21 (6): 1087. doi: 10.1063/1.1699114 (http://dx.doi.org/ 

10.1063/1.1699114). 

• Hammersley, J. M.; Handscomb, D. C. (1975). Monte Carlo Methods. London: Methuen. 
ISBN 0416523404. 

• Kahneman, D.; Tversky, A. (1982). Judgement under Uncertainty: Heuristics and Biases. 
Cambridge University Press. 

• Gould, Harvey; Tobochnik, Jan (1988). An Introduction to Computer Simulation Methods, 
Part 2, Applications to Physical Systems. Reading: Addison-Wesley. ISBN 020116504X. 



Monte Carlo method 52 

Binder, Kurt (1995). The Monte Carlo Method in Condensed Matter Physics. New York: 

Springer. ISBN 0387543694. 

Berg, Bernd A. (2004). Markov Chain Monte Carlo Simulations and Their Statistical 

Analysis (With Web-Based Fortran Code). Hackensack, NJ: World Scientific. ISBN 

9812389350. 

Caflisch, R. E. (1998). Monte Carlo and quasi-Monte Carlo methods. Acta Numerica. 7. 

Cambridge University Press, pp. 1-49. 

Doucet, Arnaud; Freitas, Nando de; Gordon, Neil (2001). Sequential Monte Carlo 

methods in practice. New York: Springer. ISBN 0387951466. 

Fishman, G. S. (1995). Monte Carlo: Concepts, Algorithms, and Applications. New York: 

Springer. ISBN 038794527X. 

MacKeown, P. Kevin (1997). Stochastic Simulation in Physics. New York: Springer. ISBN 

9813083263. 

Robert, C. P.; Casella, G. (2004). Monte Carlo Statistical Methods (2nd ed.). New York: 

Springer. ISBN 0387212396. 

Rubinstein, R. Y.; Kroese, D. P. (2007). Simulation and the Monte Carlo Method (2nd ed.). 

New York: John Wiley & Sons. ISBN 9780470177938. 

Mosegaard, Klaus; Tarantola, Albert (1995). "Monte Carlo sampling of solutions to 

inverse problems". J. Geophys. Res. 100 (B7): 12431-12447. doi: 10.1029/94JB03097 

(http://dx.doi.org/10.1029/94JB03097). 

Tarantola, Albert (2005). Inverse Problem Theory (http://www.ipgp.jussieu.fr/ 

~tarantola/Files/Professional/SIAM/index.html). Philadelphia: Society for Industrial and 
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External links 

Overview and reference list (http://mathworld.wolfram.com/MonteCarloMethod.html), 

Mathworld 

Introduction to Monte Carlo Methods (http://www.ipp.mpg.de/de/for/bereiche/ 

stellarator/Comp_sci/CompScience/csep/csepl.phy. ornl.gov/mc/mc. htm I), 

Computational Science Education Project 

Overview of formulas used in Monte Carlo simulation (http://www.sitmo.com/eqcat/ 

15), the Quant Equation Archive, at sitmo.com 

The Basics of Monte Carlo Simulations (http://www.chem.unl.edu/zeng/joy/mclab/ 

mcintro.html), University of Nebraska-Lincoln 

Introduction to Monte Carlo simulation (http://office.microsoft.com/en-us/assistance/ 

HA011118931033.aspx) (for Excel), Wayne L. Winston 

Monte Carlo Methods - Overview and Concept (http://www.brighton-webs.co.uk/ 

montecarlo/concept.asp), brighton-webs.co.uk 

Molecular Monte Carlo Intro (http://www.cooper.edu/engineering/chemechem/monte. 

html), Cooper Union 

Monte Carlo techniques applied in physics (http://homepages.ed.ac.uk/s0095122/ 

Appletl-page.htm) 

MonteCarlo Simulation in Finance (http://www.global-derivatives.com/maths/k-o. 

php), global-derivatives.com 

Approximation of n with the Monte Carlo Method (http://twt.mpei.ac.ru/MAS/ 

Worksheets/a pproxpi.mcd) 
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• Risk Analysis in Investment Appraisal (http://papers.ssrn.com/sol3/papers. 
cfm?abstract_id = 265905), The Application of Monte Carlo Methodology in Project 
Appraisal, Sawakis C. Sawides 

• Probabilistic Assessment of Structures using the Monte Carlo method (http://en. 
wikiversity.org/wiki/Probabilistic_Assessment_of_Structures), Wikiuniversity paper for 
students of Structural Engineering 

• A very intuitive and comprehensive introduction to Quasi-Monte Carlo methods (http:// 
www.puc-rio.br/marco.ind/quasi_mc.html) 

• Pricing using Monte Carlo simulation (http://knol.google.eom/k/giancarlo-vercellino/ 
pricing-using-monte-carlo-simulation/lld5i2rgd9gn5/3#), a practical example, Prof. 
Giancarlo Vercellino 

Software 

• The BUGS project (http://www.mrc-bsu.cam.ac.uk/bugs/) (including WinBUGS and 
OpenBUGS) 

• Monte Carlo Simulation, Resampling, Bootstrap tool (http://www.statisticsl01.net) 

• YASAI: Yet Another Simulation Add-In (http://yasai.rutgers.edu/) - Free Monte Carlo 
Simulation Add-In for Excel created by Rutgers University 

Monte Carlo molecular modeling 

Monte Carlo molecular modeling is the application of -» Monte Carlo methods to 
molecular problems. These problems can also be modeled by the -» molecular dynamics 
method. The difference is that this approach relies on -» statistical mechanics rather than 
molecular dynamics. Instead of trying to reproduce the dynamics of a system, it generates 
states according to appropriate Boltzmann probabilities. Thus, it is the application of the 
Metropolis Monte Carlo simulation to molecular systems. It is therefore also a 
particular subset of the more general Monte Carlo method in statistical physics. 

It employs a Markov chain procedure in order to determine a new state for a system from 
a previous one. According to its stochastic nature, this new state is accepted at random. 
Each trial usually counts as a move. The avoidance of dynamics restricts the method to 
studies of static quantities only, but the freedom to choose moves makes the method very 
flexible. These moves must only satisfy a basic condition of balance in order equilibrium be 
properly described, but detailed balance, a stronger condition, is usually imposed when 
designing new algorithms. An additional advantage is that some systems, such as the Ising 
model, lack a dynamical description and are only defined by an energy prescription; for 
these the Monte Carlo approach is the only one feasible. 

The great success of this method in statistical mechanics has led to various generalizations 
such as the method of simulated annealing for optimization, in which a fictitious 
temperature is introduced and then gradually lowered. 
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See also 

• -» Quantum Monte Carlo 

• Monte Carlo method in statistical physics 

• Software for molecular mechanics modeling 

• Bond fluctuation model 

External links 

• http://cmm.cit.nih.gov/intro_simulation/node25.html 

• http://members.aol.com/btluke/metro01.htm 
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Monte Carlo integration 



In mathematics, Monte Carlo integration is 

numerical integration using random numbers. That is, 
Monte Carlo integration methods are algorithms for the 
approximate evaluation of definite integrals, usually 
multidimensional ones. The usual algorithms evaluate 
the integrand at a regular grid. Monte Carlo methods, 
however, randomly choose the points at which the 
integrand is evaluated. 

Informally, to estimate the area of a domain D, first pick 
a simple domain d whose area is easily calculated and 
which contains D. Now pick a sequence of random 
points that fall within d. Some fraction of these points 
will also fall within D. The area of D is then estimated 
as this fraction multiplied by the area of d. 

The traditional Monte Carlo algorithm distributes the 
evaluation points uniformly over the integration region. 
Adaptive algorithms such as VEGAS and MISER use 
importance sampling and stratified sampling 
techniques to get a better result. 
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An illustration of Monte Carlo 

integration. In this example, the 

domain D is the inner circle and the 

domain d is the square. Because the 

square's area can be easily calculated, 

the area of the circle can be estimated 

by the ratio (0.8) of the points inside 

the circle (40) to the total number of 

points (50), yielding an approximation 

for -st/4 FS 0.785 



Plain pseudo-random sampling 

The algorithm computes an estimate of a multidimensional definite integral of the form, 
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1=1 dx! dx 2 ... dx Ti f(x 1 ,x 2 , ... ,x n ) = / f(p)> 



where p = [x±, . . . ,x n ] and the hypercube V is the integration volume, 

V = {p\a-[ < Xi <bi,a,2<X2 < 62, ■ ■ ■ , a n < x n < b n ] . 
The plain Monte Carlo algorithm samples points uniformly from the integration region to 
estimate the integral and its error. Suppose that the sample has size N and denote the 
points in the sample by p , ..., p . Then the estimate for the integral is given by 

jv 1=1 

where {/} denotes the sample mean of the integrand. 
The variance of the function can be estimated using 

w(/)^ = l5>Cft)-</)) a . 

According to the central limit theorem the variance of the estimate of the integral can be 
estimated as 

For large N this variance decreases asymptotically as 1/jV . The error estimate, 



50 sa V /W(Q) = ¥—==. 

decreases as 1/vJV. The familiar law of random walk applies: to reduce the error by a 

factor of 10 requires a 100-fold increase in the number of sample points. 
The above expression provides a statistical estimate of the error on the result. This error 
estimate is not a strict error bound — random sampling of the region may not uncover all 
the important features of the function, resulting in an underestimate of the error. 
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Recursive stratified sampling 

Recursive stratified sampling is a generalization of 
one-dimensional adaptive quadratures to 

multi-dimensional integrals. On each recursion step the 
integral and the error are estimated using a plain 
Monte Carlo algorithm. If the error estimate is larger 
than the required accuracy the integration volume is 
divided into sub-volumes and the procedure is 
recursively applied to sub-volumes. 

The ordinary "dividing by two' strategy does not work 
for multi-dimensions as the number of sub-volumes 
grows way too fast to keep track of. Instead one 
estimates along which dimension a subdivision should 
bring the most dividends and only subdivides the 
volume along this dimension. 

Here is a typical algorithm for recursive stratified 
sampling: 
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Sample A r random points; 

Estimate the average and the error; 

If the error is acceptable : 

Return the average and the error; 
Else : 

For each dimension : 

Subdivide the volume in two along the dimension; 
Estimate the sub-averages in the two sub-volumes; 

Pick the dimension with the largest sub-average; 

Subdivide the volume in two along this dimension; 

Dispatch two recursive calls to each of the sub-volumes; 

Estimate the grand average and grand variance; 

Return the grand average and grand variance; 



An illustration of Recursive Stratified 
Sampling. In this example, the function 

f{x,y) = (x 2 + y 2 <l)n:0 

from the above illustration was 
integrated within a unit sguare using 
the suggested algorithm. The sampled 

Doints were recorded and plotted. 

Clearly stratified sampling algorithm 

concentrates the points in the regions 

where the variation of the function is 

largest. 



The stratified sampling algorithm concentrates the sampling points in the regions where 
the variance of the function is largest thus reducing the grand variance and making the 
sampling more effective, as shown on the illustration. 

The points for the illustration have been generated by the following JavaScript-1.8 
implementation of the above algorithm, 

function strata(f ,a, b,acc,eps,N,aold, void, nold,V) 

{ 

if (typeof (N)=="undef ined" )N=42; // the number of points to be added at 

each recursion 

var randomx = f unction(a, b) [a[i]+Math . random( )*(b[i] -a[i] ) for (i in 

a)] 

var range = function(n) {for(var i=0;i<n;i++) yield i} 

var stats = function(xs){ // statistics 

var xmean=xs . reduce (function (a, b)a+b,0)/xs .length 
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var 
sigma2=xs . reduce (f unction (a, b)a+b*b,0)/xs . length -Math . pow(xmean,2) 
return [xmean,Math . sqrt (sigma2) , xs .length] 

} 
if (typeof (aold)=="undef ined" ){ // first call: setting up 'old' values 
var V=l; for(let k in a) V*=(b[k] -a[k] ) 
let xs=[ randomx(a, b) for(i in range(N))] 
let ys=[f(x) for each (x in xs)] 
var [aold, void, nold]=stats(ys) 

} 
var xs=[ randomx(a, b) for(i in range(N))] // new points 
var ys=[f(x) for each (x in xs)] // new function values 
var [av, va, ]=stats(ys) // average and variance 

var integ=V*(av*N+aold*nold)/(N+nold) // integral and error 
var error=V*Math.sqrt( (va*va*N+vold*vold*nold)/(N+nold)/(N+nold) ) 
if (error<acc+eps*Math .abs(integ) ) return [integ, error] ; // done 
else{ // not done: need to dispatch a recursive call 
var vmax=-l, kmax=0 

for(let k in a){ // look in all dimensions for which is best to 
bisect 

var [al,vl,nl]=stats( [ys[i] for(i in xs)if (xs[i] [k]< 
(a[k]+b[k])/2)]) 

var [ar, vr, nr]=stats( [ys[i] for(i in 
xs)if(xs[i][k]>=(a[k]+b[k])/2)]) 

var v=Math .abs(al-ar) // take the one with largest variation 
if(v>vmax){ // remember the values 
vmax=v; kmax=k; 
var alo=al, vlo=vl, nlo=nl; var aro=ar, vro=vr, nro=nr 

} 
} // now dispatch two recursive calls 
let a2=a . slice( ) ; a2[kmax]=(a[kmax]+b[kmax] )/2 
let b2=b . slice( ) ; b2[kmax]=(a[kmax]+b[kmax] )/2 
let [il,el]=strata(f ,a, b2,acc/l .414,eps,N,alo, vlo, nlo, V/2) 
let [i2,e2]=strata(f ,a2,b,acc/ 1.414, eps,N,aro, vro, n ro, V/2) 
return [il+i2, Math . sqrt (el*el+e2*e2) ] // return results 
} 
} 

var points=[] 
var f un=f unction( [x,y] ){ 
points. push( [x,y] ) 
return x*x+y*y<l ? 1:0 

} 
var a=[-l,-l], b=[l,l] 
var acc=eps=0.5e-2 

var [q,err]=strata(fun,a,b,acc,eps,32) 
print("# m=0, S=l") 
for each(var [x,y] in points) print(x,y) 
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The popular MISER routine implements a similar algorithm. 

MISER Monte Carlo 

The MISER algorithm of Press and Farrar is based on recursive stratified sampling. This 
technique aims to reduce the overall integration error by concentrating integration points 
in the regions of highest variance. 

The idea of stratified sampling begins with the observation that for two disjoint regions a 
and b with Monte Carlo estimates of the integral E a (f) and E b (f) and variances o"~ a {f) and 
ol{f), the variance Var(f) of the combined estimate E(f) = (l/2)(E a (f) + E b (f)) is 
given by, 

Var(/) = K 2 (/)/4JV a ) + (vt(j)jAN b ) 
It can be shown that this variance is minimized by distributing the points such that, 

N*i{Na + N b )=<T a i{o a +<T b ) 

Hence the smallest error estimate is obtained by allocating sample points in proportion to 
the standard deviation of the function in each sub-region. 

The MISER algorithm proceeds by bisecting the integration region along one coordinate 
axis to give two sub-regions at each step. The direction is chosen by examining all d 
possible bisections and selecting the one which will minimize the combined variance of the 
two sub-regions. The variance in the sub-regions is estimated by sampling with a fraction of 
the total number of points available to the current step. The same procedure is then 
repeated recursively for each of the two half-spaces from the best bisection. The remaining 
sample points are allocated to the sub-regions using the formula for N_a and N_b. This 
recursive allocation of integration points continues down to a user-specified depth where 
each sub-region is integrated using a plain Monte Carlo estimate. These individual values 
and their error estimates are then combined upwards to give an overall result and an 
estimate of its error. 

This routines uses the MISER Monte Carlo algorithm to integrate the function f over the 
dim-dimensional hypercubic region defined by the lower and upper limits in the arrays xl 
and xu, each of size dim. The integration uses a fixed number of function calls, and obtains 
random sampling points using the random number generator r. A previously allocated 
workspace s must be supplied. The result of the integration is returned in result, with an 
estimated absolute error abserr. 

Configurable Parameters 

The MISER algorithm has several configurable parameters. 

estimatefrac 

This parameter specifies the fraction of the currently available number of function calls 
which are allocated to estimating the variance at each recursive step. In the GNU Scientific 
Library's (GSL) implementation, the default value is 0.1. 

mincalls 

This parameter specifies the minimum number of function calls required for each estimate 
of the variance. If the number of function calls allocated to the estimate using estimatefrac 
falls below min calls then min calls are used instead. This ensures that each estimate 
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maintains a reasonable level of accuracy. In the GNU Scientific Library's implementation, 
the default value of mincalls is 16 * dim. 

mincallsperbisection 

This parameter specifies the minimum number of function calls required to proceed with a 
bisection step. When a recursive step has fewer calls available than mincallsperbisection 
it performs a plain Monte Carlo estimate of the current sub-region and terminates its 
branch of the recursion. In the GNU Scientific Library's implementation, the default value 
of this parameter is 32 * mincalls. 

alpha 

This parameter controls how the estimated variances for the two sub-regions of a bisection 
are combined when allocating points. With recursive sampling the overall variance should 
scale better than 1/N, since the values from the sub-regions will be obtained using a 
procedure which explicitly minimizes their variance. To accommodate this behavior the 
MISER algorithm allows the total variance to depend on a scaling parameter \alpha, 
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The authors of the original paper describing MISER recommend the value a = 2 as a good 
choice, obtained from numerical experiments, and this is used as the default value in the 
GNU Scientific Library's implementation. 

dither 

This parameter introduces a random fractional variation of size dither into each bisection, 
which can be used to break the symmetry of integrands which are concentrated near the 
exact center of the hypercubic integration region. In the GNU Scientific Library's 
implementation, the default value of dither is zero, so no variation is introduced. If needed, 
a typical value of dither is around 0.1. 

Importance sampling 

VEGAS Monte Carlo 

The VEGAS algorithm of G.P.Lepage is based on importance sampling. It samples points 
from the probability distribution described by the function |/| , so that the points are 
concentrated in the regions that make the largest contribution to the integral. 

In general, if the Monte Carlo integral of / is sampled with points distributed according to 

a probability distribution described by the function 9 , we obtain an estimate E g (f; N) i 

E g (f;N)=E(f/g;N) 

with a corresponding variance, 

Var g (f;N) = Var(f/g;N) 

If the probability distribution is chosen as Q = |/|/J(|/|) then it can be shown that the 
variance Vg\f\ N) vanishes, and the error in the estimate will be zero. In practice it is not 
possible to sample from the exact distribution 9 for an arbitrary function, so importance 
sampling algorithms aim to produce efficient approximations to the desired distribution. 



Monte Carlo integration 60 

The VEGAS algorithm approximates the exact distribution by making a number of passes 
over the integration region while histogramming the function /. Each histogram is used to 
define a sampling distribution for the next pass. Asymptotically this procedure converges to 
the desired distribution. In order to avoid the number of histogram bins growing like K d 
the probability distribution is approximated by a separable function: 
g(xi,x 2 ,. ■ ■) = 51(^1)5-2(^2) ■ ■ ■ so that the number of bins required is only Kd . This is 
equivalent to locating the peaks of the function from the projections of the integrand onto 
the coordinate axes. The efficiency of VEGAS depends on the validity of this assumption. It 
is most efficient when the peaks of the integrand are well-localized. If an integrand can be 
rewritten in a form which is approximately separable this will increase the efficiency of 
integration with VEGAS. 

VEGAS incorporates a number of additional features, and combines both stratified sampling 
and importance sampling. The integration region is divided into a number of "boxes", with 
each box getting a fixed number of points (the goal is 2). Each box can then have a 
fractional number of bins, but if bins/box is less than two, Vegas switches to a kind variance 
reduction (rather than importance sampling). 

This routines uses the VEGAS Monte Carlo algorithm to integrate the function / over the 
dim-dimensional hypercubic region defined by the lower and upper limits in the arrays xl 
and £u , each of size dim . The integration uses a fixed number of function calls calls, and 
obtains random sampling points using the random number generator r . A previously 
allocated workspace * must be supplied. The result of the integration is returned in result 
, with an estimated absolute error abserr . The result and its error estimate are based on a 
weighted average of independent samples. The chi-squared per degree of freedom for the 
weighted average is returned via the state struct component, s — > chisq, and must be 
consistent with 1 for the weighted average to be reliable. 

The VEGAS algorithm computes a number of independent estimates of the integral 
internally, according to the iterations parameter described below, and returns their 
weighted average. Random sampling of the integrand can occasionally produce an estimate 
where the error is zero, particularly if the function is constant in some regions. An estimate 
with zero error causes the weighted average to break down and must be handled 
separately. In the original Fortran implementations of VEGAS the error estimate is made 
non-zero by substituting a small value (typically le-30). The implementation in GSL differs 
from this and avoids the use of an arbitrary constant - it either assigns the value a weight 
which is the average weight of the preceding estimates, or discards it according to the 
following procedure: 

• Current estimate has zero error, weighted average has finite error 

The current estimate is assigned a weight which is the average weight of the preceding 
estimates. 

• Current estimate has finite error, previous estimates had zero error 

The previous estimates are discarded and the weighted averaging procedure begins with 
the current estimate. 

• Current estimate has zero error, previous estimates had zero error 

The estimates are averaged using the arithmetic mean, but no error is computed. 
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Configurable Parameters 

The VEGAS algorithm is configurable. 

chisq 

This parameter gives the chi-squared per degree of freedom for the weighted estimate of 
the integral. The value of chisq should be close to 1. A value of chisq which differs 
significantly from 1 indicates that the values from different iterations are inconsistent. In 
this case the weighted error will be under-estimated, and further iterations of the algorithm 
are needed to obtain reliable results. 

alpha 

The parameter alpha controls the stiffness of the rebinning algorithm. It is typically set 
between one and two. A value of zero prevents rebinning of the grid. In the GNU Scientific 
Library's implementation, the default value is 1.5. 

iterations 

The number of iterations to perform for each call to the routine. In the GNU Scientific 
Library's implementation, the default value is 5 iterations. 

stage 

Setting this determines the stage of the calculation. Normally, stage = which begins with 
a new uniform grid and empty weighted average. Calling vegas with stage = 1 retains the 
grid from the previous run but discards the weighted average, so that one can "tune" the 
grid using a relatively small number of points and then do a large run with stage = 1 on the 
optimized grid. Setting stage = 2 keeps the grid and the weighted average from the 
previous run, but may increase (or decrease) the number of histogram bins in the grid 
depending on the number of calls available. Choosing stage = 3 enters at the main loop, so 
that nothing is changed, and is equivalent to performing additional iterations in a previous 
call. 

mode 

The possible choices are GSLVEGASMODEIMPORTANCE, 

GSLVEGASMODESTRATIFIED, GSLVEGASMODEIMPORTANCEONLY. This 

determines whether VEGAS will use importance sampling or stratified sampling, or 
whether it can pick on its own. In low dimensions VEGAS uses strict stratified sampling 
(more precisely, stratified sampling is chosen if there are fewer than 2 bins per box). 



Monte Carlo integration 62 

See also 

• Auxiliary field Monte Carlo 

• -» Monte Carlo method 

References and further reading 

The following reference about Monte Carlo and quasi-Monte Carlo methods in general (with 
a description of the variance reduction techniques) is excellent to start with: 

• R. E. Caflisch, Monte Carlo and quasi-Monte Carlo methods, Acta Numerica vol. 7, 
Cambridge University Press, 1998, pp. 1-49. 

Nice survey on arXiv, based on lecture for graduate students in high energy physics: 

• S. Weinzierl, Introduction to Monte Carlo methods ' , 
The MISER algorithm is described in the following article, 

• W.H. Press, G.R. Farrar, Recursive Stratified Sampling for Multidimensional Monte Carlo 
Integration, Computers in Physics, v4 (1990), ppl90-195. 

The VEGAS algorithm is described in the following papers, 

• G.P. Lepage, A New Algorithm for Adaptive Multidimensional Integration, Journal of 
Computational Physics 27, 192-203, (1978) 

• G.P. Lepage, VEGAS: An Adaptive Multi-dimensional Integration Program, Cornell 
preprint CLNS 80-447, March 1980 

Early works: 

• J. M. Hammersley, D.C. Handscomb (1964) Monte Carlo Methods. Methuen. ISBN 
0-416-52340-4 

Based on the GNU Scientific Library's manual, which is published under the GFDL (and 
hence free to use for Wikipedia). Original available here . 

External links 

• Module for Monte Carlo Integration 

• Internet Resources for Monte Carlo Integration *■ ' 
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[1] http://arxiv.org/abs/hep-ph/0006269/ 

[2] http://www.gnu.org/software/gsl/manual/gsl-ref_23.html 

[3] http://math.fullerton.edu/mathews/n2003/MonteCarloMod.html 

[4] http://math.fullerton.edu/mathews/n2003/montecarlo/MonteCarloBib/Links/MonteCarloBib_lnk_l.html 
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Quasi-Monte Carlo method 

In numerical analysis, a quasi-Monte Carlo method is a method for the computation of an 
integral (or some other problem) that is based on low-discrepancy sequences. This is in 
contrast to a regular -» Monte Carlo method, which is based on sequences of pseudorandom 
numbers. 

Monte Carlo and quasi-Monte Carlo methods are stated in a similar way. The problem is to 
approximate the integral of a function f as the average of the function evaluated at a set of 
points x y ..., x N . 

1 N 

where I s is the s-dimensional unit cube, I s = [0, 1] x ... x [0, 1]. (Thus each x. is a vector of s 
elements.) In a Monte Carlo method, the set x , ..., x is a subsequence of pseudorandom 
numbers. In a quasi-Monte Carlo method, the set is a subsequence of a low-discrepancy 
sequence. 

The approximation error of a method of the above type is bounded by a term proportional to 
the discrepancy of the set x , ..., x , by the Koksma-Hlawka inequality. The discrepancy of 
sequences typically used for the quasi-Monte Carlo method is bounded by a constant times 

N ' 
In comparison, with probability one, the expected discrepancy of a uniform random 

sequence (as used in the Monte Carlo method) has an order of convergence 

/ log log N 

by the law of the iterated logarithm. 

Thus it would appear that the accuracy of the quasi-Monte Carlo method increases faster 
than that of the Monte Carlo method. However, Morokoff and Caflisch cite examples of 
problems in which the advantage of the quasi-Monte Carlo is less than expected 
theoretically. Still, in the examples studied by Morokoff and Caflisch, the quasi-Monte Carlo 
method did yield a more accurate result than the Monte Carlo method with the same 
number of points. 

Morokoff and Caflisch remark that the advantage of the quasi-Monte Carlo method is 
greater if the integrand is smooth, and the number of dimensions s of the integral is small. 
A technique, coined randomized quasi-Monte Carlo, that mixes quasi-Monte Carlo with 
traditional Monte Carlo, extends the benefits of quasi-Monte Carlo to medium to large s. 
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Application areas 

• Monte Carlo methods in finance 

See also 

• -» Monte Carlo method 
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• R. E. Caflisch, Monte Carlo and quasi-Monte Carlo methods, Acta Numerica vol. 7, 
Cambridge University Press, 1998, pp. 1-49. 

External links 

• A very intuitive and comprehensive introduction to Quasi-Monte Carlo methods 
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Quantum Dynamics 



Quantum thermodynamics 

In the physical sciences, quantum thermodynamics is the study of heat and work 
dynamics in quantum systems. Approximately, quantum thermodynamics attempts to 
combine thermodynamics and quantum mechanics into a coherent whole. The essential 
point at which "quantum mechanics" beqan was when, in 1900, Max Planck outlined the 
"quantum hypothesis", i.e. that the enerqy of atomic systems can be quantized, as based on 
the first two laws of thermodynamics as described by Rudolf Clausius (1865) and Ludwiq 
Boltzmann (1877). ] [ ] See the history of quantum mechanics for a more detailed outline. 

Overview 

A central objective in quantum thermodynamics is the quantitative and qualitative 
determination of the laws of thermodynamics at the quantum level in which uncertainty and 
probability beqin to take effect. A fundamental question is: what remains of 
thermodynamics if one qoes to the extreme limit of small quantum systems havinq a few 
deqrees of freedom? If thermodynamics applies at this level, are the many formulations of 
the second law of thermodynamics, i.e. the entropy of a closed system cannot decrease, 
heat flows from hiqh to low temperature, systems evolve towards minimum potential 
enerqy wells, enerqy tends to dissipate, etc., still applicable, or is there a more "universal" 
formulation? 

See also 

• Quantum decoherence 

References 

[1] Planck, Max. (1900). " Entropy and Temperature of Radiant Heat (http://www.iee.org/publish/inspec/ 

prodcat/1900A01446. xml) ." Annalen der Physick, vol. 1. no 4. April, pg. 719-37. 
[2] Planck, Max. (1901). " On the Law of Distribution of Energy in the Normal Spectrum (http://dbhs.wvusd.kl2. 

ca.us/webdocs/Chem-History/Planck-1901/Planck-1901.html)". Annalen der Physik, vol. 4, p. 553 ff. 

Further reading 

1. Gemmer, J., Michel, M., Mahler, G. (2005). Quantum Thermodynamics - Emergence of 
Thermodynamic Behavior Within Composite Quantum Systems. Sprinqer. ISBN 
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External links 

• Quantum Thermodynamics and the Gibbs Paradox (http://staff.science.uva.nl/ 
~nieuwenh/QL2L.html) 

• Quantum Thermodynamics (http://www.chaos.org.uk/~eddy/physics/heat.html) 

• On the Classical Limit of Quantum Thermodynamics in Finite Time (http://www.fh.huji. 
ac.il/~ronnie/Papers/geva92.pdf) [PDF-format] 

• Quantum Thermodynamics (http://www.quantumthermodynamics.org) - list of good 
related articles 



Quantum chemistry 



Quantum chemistry is a branch of theoretical chemistry, which applies quantum 
mechanics and quantum field theory to address issues and problems in chemistry. The 
description of the electronic behavior of atoms and molecules as pertaining to their 
reactivity is one of the applications of quantum chemistry. Quantum chemistry lies on the 
border between chemistry and physics, and significant contributions have been made by 
scientists from both fields. It has a strong and active overlap with the field of atomic 
physics and molecular physics, as well as physical chemistry. 

Quantum chemistry mathematically describes the fundamental behavior of matter at the 
molecular scale. It is, in principle, possible to describe all chemical systems using this 
theory. In practice, only the simplest chemical systems may realistically be investigated in 
purely quantum mechanical terms, and approximations must be made for most practical 
purposes (e.g., Hartree-Fock, post Hartree-Fock or Density functional theory, see 
computational chemistry for more details). Hence a detailed understanding of quantum 
mechanics is not necessary for most chemistry, as the important implications of the theory 
(principally the orbital approximation) can be understood and applied in simpler terms. 

In quantum mechanics the Hamiltonian, or the physical state, of a particle can be expressed 
as the sum of two operators, one corresponding to kinetic energy and the other to potential 
energy. The Hamiltonian in the Schrodinger wave equation used in quantum chemistry does 
not contain terms for the spin of the electron. 

Solutions of the Schrodinger equation for the hydrogen atom gives the form of the wave 
function for atomic orbitals, and the relative energy of the various orbitals. The orbital 
approximation can be used to understand the other atoms e.g. helium, lithium and carbon. 

History 

The history of quantum chemistry essentially began with the 1838 discovery of cathode 
rays by Michael Faraday, the 1859 statement of the black body radiation problem by Gustav 
Kirchhoff, the 1877 suggestion by Ludwig Boltzmann that the energy states of a physical 
system could be discrete, and the 1900 quantum hypothesis by Max Planck that any energy 
radiating atomic system can theoretically be divided into a number of discrete energy 
elements e such that each of these energy elements is proportional to the frequency v with 
which they each individually radiate energy, as defined by the following formula: 

e = hi/ 
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where h is a numerical value called Planck's Constant. Then, in 1905, to explain the 
photoelectric effect (1839), i.e., that shining light on certain materials can function to eject 
electrons from the material, Albert Einstein postulated, based on Planck's quantum 
hypothesis, that light itself consists of individual quantum particles, which later came to be 
called photons (1926). In the years to follow, this theoretical basis slowly began to be 
applied to chemical structure, reactivity, and bonding. 

Electronic structure 

The first step in solving a quantum chemical problem is usually solving the Schrodinger 
equation (or Dirac equation in relativistic quantum chemistry) with the electronic molecular 
Hamiltonian. This is called determining the electronic structure of the molecule. It can be 
said that the electronic structure of a molecule or crystal implies essentially its chemical 
properties. 

Wave model 

The foundation of quantum mechanics and quantum chemistry is the wave model, in which 
the atom is a small, dense, positively charged nucleus surrounded by electrons. Unlike the 
earlier Bohr model of the atom, however, the wave model describes electrons as "clouds" 
moving in orbitals, and their positions are represented by probability distributions rather 
than discrete points. The strength of this model lies in its predictive power. Specifically, it 
predicts the pattern of chemically similar elements found in the periodic table. The wave 
model is so named because electrons exhibit properties (such as interference) traditionally 
associated with waves. See wave-particle duality. 

Valence bond 

Although the mathematical basis of quantum chemistry had been laid by Schrodinger in 

1926, it is generally accepted that the first true calculation in quantum chemistry was that 
of the German physicists Walter Heitler and Fritz London on the hydrogen (H ) molecule in 

1927. Heitler and London's method was extended by the American theoretical physicist 
John C. Slater and the American theoretical chemist Linus Pauling to become the 
Valence-Bond (VB) [or Heitler-London-Slater-Pauling (HLSP)] method. In this 
method, attention is primarily devoted to the pairwise interactions between atoms, and this 
method therefore correlates closely with classical chemists' drawings of bonds. 

Molecular orbital 

An alternative approach was developed in 1929 by Friedrich Hund and Robert S. Mulliken, 
in which electrons are described by mathematical functions delocalized over an entire 
molecule. The Hund-Mulliken approach or molecular orbital (MO) method is less 
intuitive to chemists, but has turned out capable of predicting spectroscopic properties 
better than the VB method. This approach is the conceptional basis of the Hartree-Fock 
method and further post Hartree-Fock methods. 
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Density functional theory 

The Thomas-Fermi model was developed independently by Thomas and Fermi in 1927. 
This was the first attempt to describe many-electron systems on the basis of electronic 
density instead of wave functions, although it was not very successful in the treatment of 
entire molecules. The method did provide the basis for what is now known as density 
functional theory. Though this method is less developed than post Hartree-Fock methods, 
its lower computational requirements allow it to tackle larger polyatomic molecules and 
even macromolecules, which has made it the most used method in computational chemistry 
at present. 

Chemical dynamics 

A further step can consist of solving the Schrodinger equation with the total molecular 
Hamiltonian in order to study the motion of molecules. Direct solution of the Schrodinger 
equation is called quantum molecular dynamics, within the semiclassical approximation 
semiclassical molecular dynamics, and within the classical mechanics framework -» 
molecular dynamics (MD). Statistical approaches, using for example -» Monte Carlo 
methods, are also possible. 

Adiabatic chemical dynamics 

Main article: Adiabatic formalism or Born-Oppenheimer approximation 

In adiabatic dynamics, interatomic interactions are represented by single scalar 
potentials called potential energy surfaces. This is the Born-Oppenheimer approximation 
introduced by Born and Oppenheimer in 1927. Pioneering applications of this in chemistry 
were performed by Rice and Ramsperger in 1927 and Kassel in 1928, and generalized into 
the RRKM theory in 1952 by Marcus who took the transition state theory developed by 
Eyring in 1935 into account. These methods enable simple estimates of unimolecular 
reaction rates from a few characteristics of the potential surface. 

Non-adiabatic chemical dynamics 

Non-adiabatic dynamics consists of taking the interaction between several coupled 
potential energy surface (corresponding to different electronic quantum states of the 
molecule). The coupling terms are called vibronic couplings. The pioneering work in this 
field was done by Stueckelberg, Landau, and Zener in the 1930s, in their work on what is 
now known as the Landau-Zener transition. Their formula allows the transition probability 
between two diabatic potential curves in the neighborhood of an avoided crossing to be 
calculated. 

Quantum chemistry and quantum field theory 

The application of quantum field theory (QFT) to chemical systems and theories has become 
increasingly common in the modern physical sciences. One of the first and most 
fundamentally explicit appearances of this is seen in the theory of the photomagneton. In 
this system, plasmas, which are ubiquitous in both physics and chemistry, are studied in 
order to determine the basic quantization of the underlying bosonic field. However, 
quantum field theory is of interest in many fields of chemistry, including: nuclear chemistry, 
astrochemistry, sonochemistry, and quantum hydrodynamics. Field theoretic methods have 
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also been critical in developing the ab initio Effective Hamiltonian theory of semi-empirical 
pi-electron methods. 

See also 

Atomic physics 

Computational chemistry 

Condensed matter physics 

International Academy of Quantum Molecular Science 

Physical chemistry 

Quantum chemistry computer programs 

Quantum electrochemistry 

QMC@Home 

Theoretical physics 

Further reading 

• Pauling, L. (1954). General Chemistry. Dover Publications. ISBN 0-486-65622-5. 

• Pauling, L., and Wilson, E. B. Introduction to Quantum Mechanics with Applications to 
Chemistry (Dover Publications) ISBN 0-486-64871-0 

• Atkins, P.W. Physical Chemistry (Oxford University Press) ISBN 0-19-879285-9 

• McWeeny, R. Coulson's Valence (Oxford Science Publications) ISBN 0-19-855144-4 

• Landau, L.D. and Lifshitz, E.M. Quantum Mechanics-.Non-relativistic Theory (Course of 
Theoretical Physics vol.3) (Pergamon Press) 

• Bernard Pullman and Alberte Pullman. 1963. Quantum Biochemistry., New York and 
London: Academic Press. 

• Eric R. Scerri, The Periodic Table: Its Story and Its Significance, Oxford University Press, 
2006. Considers the extent to which chemistry and especially the periodic system has 
been reduced to quantum mechanics. ISBN 0-19-530573-6. 

• Simon, Z. 1976. Quantum Biochemistry and Specific Interactions., Taylor & Francis; 
ISBN-13: 978-0856260872 and ISBN 0-85-6260878 . 
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Nobel lectures by quantum chemists 

• Walter Kohn's Nobel lecture (http://nobelprize.org/chemistry/laureates/1998/ 
kohn-lecture.html) 

• Rudolph Marcus' Nobel lecture (http://nobelprize.org/chemistry/laureates/1992/ 
marcus-lecture.html) 

• Robert Mulliken's Nobel lecture (http://nobelprize.org/chemistry/laureates/1966/ 
mulliken-lecture.html) 

• Linus Pauling's Nobel lecture (http://nobelprize.org/chemistry/laureates/1954/ 
pauling-lecture.html) 

• John Pople's Nobel lecture (http://nobelprize.org/chemistry/laureates/1998/ 
pople-lecture.html) 
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Electronic structure methods 

Tight binding 



Nearly-free electron model 
Hartree-Fock 



Modern valence bond 
Generalized valence bond 



Moller-Plesset perturbation theory 
Configuration interaction 



Coupled cluster 
Multi-configurational self-consistent field 



Density functional theory 
Quantum chemistry composite methods 



-» Quantum Monte Carlo 
k-p perturbation theory 



Muffin-tin approximation 
LCAO method 



Quantum Monte Carlo is a large class of computer algorithms that simulate quantum 
systems with the idea of solving the many-body problem. They use, in one way or another, 
the -» Monte Carlo method to handle the many-dimensional integrals that arise. Quantum 
Monte Carlo allows a direct representation of many-body effects in the wavefunction, at the 
cost of statistical uncertainty that can be reduced with more simulation time. For bosons, 
there exist numerically exact and polynomial-scaling algorithms. For fermions, there exist 
very good approximations and numerically exact exponentially scaling quantum Monte 
Carlo algorithms, but none that are both. 
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Background 

In principle, any physical system can be described by the many-body Schrodinger equation 
as long as the constituent particles are not moving "too" fast; that is, they are not moving 
near the speed of light. This includes the electrons in almost every material in the world, so 
if we could solve the Schrodinger equation, we could predict the behavior of any electronic 
system, which has important applications in fields from computers to biology. This also 
includes the nuclei in Bose-Einstein condensate and superfluids such as liquid helium. The 
difficulty is that the Schrodinger equation involves a function of three times the number of 
particles and is difficult to solve even using parallel computing technology in a reasonable 
amount of time (less than 2 years). Traditionally, theorists have approximated the 
many-body wave function as an antisymmetric function of one-body orbitals, as shown 
concisely at this link. This kind of formulation either limits the possible wave functions, as 
in the case of the Hartree-Fock (HF) approximation, or converges very slowly, as in 
configuration interaction. One of the reasons for the difficulty with an HF initial estimate 
(ground state seed, also known as Slater determinant) is that it is very difficult to model the 
electronic and nuclear cusps in the wavefunction. However, one does not generally model 
at this point of the approximation. As two particles approach each other, the wavefunction 
has exactly known derivatives. 

Quantum Monte Carlo is a way around these problems because it allows us to model a 
many-body wavefunction of our choice directly. Specifically, we can use a Hartree-Fock 
approximation as our starting point but then multiplying it by any symmetric function, of 
which Jastrow functions are typical, designed to enforce the cusp conditions. Most methods 
aim at computing the ground-state wavefunction of the system, with the exception of path 
integral Monte Carlo and finite-temperature auxiliary field Monte Carlo, which calculate the 
density matrix. 

There are several quantum Monte Carlo methods, each of which uses Monte Carlo in 
different ways to solve the many-body problem: 

Quantum Monte Carlo methods 

• Variational Monte Carlo : A good place to start; it is commonly used in many sorts of 
quantum problems. 

• Diffusion Monte Carlo : The most common high-accuracy method for electrons (that is, 
chemical problems), since it comes quite close to the exact ground-state energy fairly 
efficiently. Also used for simulating the quantum behavior of atoms, etc. 

• Path integral Monte Carlo : Finite-temperature technique mostly applied to bosons where 
temperature is very important, especially superfluid helium. 

• Auxiliary field Monte Carlo : Usually applied to lattice problems, although there has been 
recent work on applying it to electrons in chemical systems. 

• Reptation Monte Carlo : Recent zero-temperature method related to path integral Monte 
Carlo, with applications similar to diffusion Monte Carlo but with some different 
tradeoffs. 

• Gaussian quantum Monte Carlo 
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See also 

-» Monte Carlo method 

QMC@Home 

-» Quantum chemistry 

Density matrix renormalization group 

Time-evolving block decimation 

Metropolis algorithm 

Wavefunction optimization 
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A force field, sometimes known as an energy shield, force shield, or deflector shield is 

a barrier, typically made of energy or charged particles, that protects a person, area or 
object from attacks or intrusions. Force fields tend to appear often in works of speculative 
fiction. 

In real life 

A University of Washington in Seattle group has been experimenting with using a bubble of 
charged plasma to surround a spacecraft, contained by a fine mesh of superconducting 
wire. This would protect the spacecraft from interstellar radiation and some particles 
without needing physical shielding. 

Likewise, Rutherford Appleton Laboratory is attempting to design an actual test satellite, 
which should orbit Earth with a charged plasma field around it. 

Workers at a 3M factory in South Carolina have encountered electrostatically-charged air 
that impeded movement, with the problem fixed by properly grounding the equipment 
causing the stray charges. [ ] 

Plasma windows have some similarities to force fields, being difficult for matter to 
transverse. 

In fiction 

Science fiction and fantasy venues postulate a number of potential uses for force fields: 

• A barrier to allow workers to work in areas that can be exposed to the vacuum of space, 
keeping the atmosphere inside while allowing certain other objects to pass through. 

• Emergency quarantine of an area afflicted by a harmful biological or chemical agent or 
occupied by enemy forces. 

• The extinguishing of a fire by forcing the reaction to use up all the available oxygen in 
the confined space. 

• As a shield (in some cases large enough to cover an entire planet) to protect something 
from damage by natural forces or enemy attack. 

• To create a temporary habitable space in a place not usually suited to sustaining life. 

• As a security method to direct someone in a particular direction for capture, or to confine 
a captive in a particular area. 

The concept goes back at least as far as the 1920s, in the works of E.E. 'Doc' Smith and 
others; and William Hope Hodgson's The Night Land (1912) has the Last Redoubt, in which 
the remnants of humanity shelter, protected by something very like a force field. 

The abilities and exact functionality of energy shields vary; in some works (such as in the 
Star Trek universe), energy shields can stop both energy and particle weapons (e.g. 
phasers) and normal projectiles, both natural and artificial; in others, such as the Star Wars 
universe, there are multiple types of force fields that defend against different sorts of 
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attacks. 

Energy shields usually work by absorbing or dissipating the energy of the incoming attacks- 
prolonged exposure to such attacks weakens the shield and eventually results in the 
shield's collapse, making the ship's hull (or building's walls, or planet's surface) vulnerable 
to attack. An example of this is the aerial Serin Battleships in Command and Conquer 3: 
Tiberium Wars. Larger energy shield systems, or those powered by bigger energy sources, 
can absorb/dissipate more damage before failing -- so that larger starships, for example, 
can mount much stronger shields than a small, single-person starfighter, much in the way 
that a seagoing battleship has much thicker armor than a tiny patrol boat. In some 
instances, shields are actually able to incorporate at least some of the projectile's kinetic 
energy into themselves, making the shield stronger with each hit. 

In some works, shields are completely invulnerable to all technology of the time, yet can 
only be operated for a limited period of time, or at a great expense of energy. This is often 
used in games to give the player temporary invulnerability (for example, the "Iron Curtain" 
used by the Soviet Union in Command & Conquer: Red Alert 2). In other examples, shields 
are invulnerable, yet they are unreliable, meaning they can't block everything. One such 
example is Haegemonia, where the basic deflector shields deployed in the late stages of the 
Earth-Mars civil war were capable of blocking projectiles with an average chance of 30% 
(meaning shots went right through the shield 70% of the time). This ratio was later 
improved, yet these shields weren't completely invulnerable; in addition, although the most 
advanced shields used at the beginning of the Human-Kariak-Darzok war boasted an 
impressive 80% blocking ratio, they were largely ineffective (only about 40%) against 
missile weapons, a lethal liability on planetary shields. 

Very rarely, weapons designed specifically against shields also appeared (such as the 
"phasing cannon" in X-COM: Interceptor: it was designed to rapidly deplete the target 
craft's shields so that it can be finished off or disabled quickly). These pieces of equipment 
were designed to shut down or otherwise weaken the targeted ship/building/planet's shields 
so that other weapons can be brought to bear on the vulnerable hull/walls/planetary 
surface. An even rarer type of anti-shield weaponry include projectiles that are capable of 
penetrating the shield itself (notably the "Shield Breaker" round designed for use in the 
Exitus sniper rifle, which is in turn used by Vindicare Assassins in Warhammer 40,000, 
often against individuals using personal shields). 

Examples 

• The ability to create a force field is a popular superpower in comic books and associated 
media. While only a few characters have the explicit ability to create force fields (for 
example, the Invisible Woman and Violet Parr), many can emulate it with other powers, 
such as Green Lantern's energy constructs, Jean Grey's telekinesis, and Magneto's 
manipulation of electromagnetic fields. 

• In the various Star Trek series, shields that function in an unexplained manner serve as a 
primary protection against weapons fire from enemy ships. Also, inside ships, force field 
generators can seal off ship atmosphere from the outside vacuum, in the case when a 
photon torpedo or phaser beam penetrates the ship's hull resulting in loose items and 
people being blown out to a slow and silent death. 

• In the Star Wars universe, deflector shields are standard issue on most ships and perform 
a function similar to that of their Star Trek counterparts. Unlike Star Trek, multiple kinds 
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of shielding are required to guard against different weapons and interstellar debris. 

• In the popular RTS Starcraft, all Protoss units and buildings are surrounding by 
slowly-regenerating psionic force fields, which will dissipate after taking a set amount of 
damage, allowing the health of the unit to be attacked directly. 

• In Neon Genesis Evangelion, AT fields are explained to be simultaneously the defensive 
barrier generated by the Angels and Evangelions and the psychological barrier that 
separates the identities of different people, with the only difference between the two 
types being that the Angels and Evangelions can project it outwards. 

• In the Dune universe, personal shields have been deployed that can stop objects with 
high kinetic energy. As well, when struck by an energy weapon, the subsequent reaction 
makes both the shield and emitter explode on a nuclear scale. In response to this, many 
have returned to using melee weapons, slowed before strike to penetrate the shield. 
Shields cannot be used on Arrakis as they attract nearby sandworms and drive them into 
killing frenzy. The Fremen, who are used to acting without the slow-down necessitated by 
the shields, take advantage of this to hurt their enemies with devastating efficiency. 

• In Isaac Asimov's Foundation universe, personal shields have been developed by 
scientists specialising in miniaturisation. Used by Traders, they are unknown to other 
inhabitants of the Galactic Empire 

• In the Halo Series, shielding is used by the main character's MJOLNIR powered assault 
armour, made by the UNSC from reverse-engineered alien technology. It can protect 
against projectile weapons to a large degreee, yet not so well against plasma weaponry. 
Hits will strain the shield and eventually deplete it, but if the wearer can stay out of the 
line of fire for a while, the shield will recharge back to full strength. Additionally, shields 
can be overcharged to absorb up to 300% damage compared to normal operating mode, 
at the expense that an overloaded shield can't recharge until fully depleted. 

• In Sins of a Solar Empire, shields protect against all damage, but take an equivalent 
amount of damage. They constantly regenerate. Shields can be bypassed with Phase 
Missiles, tech which is exclusive to the Vasari faction. Shields, like Phase Drives, are 
standard equipment on all combat ships bigger than a fighter. Shields can be upgraded 
up to four times. 

• In Outlaw Star, several characters are equipped with personal Light Shields, which 
absorb all but the most powerful weapons fire, but have limited power and can only be 
operated for seconds or minutes. The shield generator can also be overloaded and 
destroyed if the shield sustains too much damage. 
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Computational Physics and 

Sciences 

Computational Science 

1. REDIRECT Computational science 

This is a redirect from a title with another method of capitalisation. It leads to the title in 
accordance with the Wikipedia naming conventions for capitalisation, and can help writing, 
searching, and international language issues. 

Pages linking to any of these redirects may be updated to link directly to the target page. 
However, do not replace these redirected links with a piped link unless the page is updated 
for another reason. 

For more information, see Category: Redirects from other capitalisations. 

Computational Chemistry List 

The Computation Chemistry List (CCL) was established on January 11 1991, as an 
independent electronic forum for chemistry researchers and educators from around the 
world. According to the forum's web site, it is estimated that more than 3000 members in 
more than 50 countries are reading CCL messages regularly, and the discussions cover all 
aspects of computational chemistry. The list is widely supported and used by the 
computational chemistry community. The list also hosts many resources on 

computational chemistry. For example, it hosted a pre-publication version of Computational 
Chemistry by David Young. 
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Mathematical model 

Note: The term model has a different meaning in model theory, a branch of 
mathematical logic. An artifact which is used to illustrate a mathematical idea is also 
called a mathematical model and this usage is the reverse of the sense explained 
below. 

A mathematical model uses mathematical language to describe a system. Mathematical 
models are used not only in the natural sciences and engineering disciplines (such as 
physics, biology, earth science, meteorology, and engineering) but also in the social 
sciences (such as economics, psychology, sociology and political science); physicists, 
engineers, computer scientists, and economists use mathematical models most extensively. 

Eykhoff (1974) defined a mathematical model as 'a representation of the essential aspects 
of an existing system (or a system to be constructed) which presents knowledge of that 
system in usable form'. ] 

Mathematical models can take many forms, including but not limited to dynamical systems, 
statistical models, differential equations, or game theoretic models. These and other types 
of models can overlap, with a given model involving a variety of abstract structures. 

Examples of mathematical models 

• Population Growth. A simple (though approximate) model of population growth is the 
Malthusian growth model. A slightly more realistic and largely used population growth 
model is the logistic function, and its extensions. 

• Model of a particle in a potential-field . In this model we consider a particle as being a 
point of mass m which describes a trajectory which is modeled by a function x : R -» R 
given its coordinates in space as a function of time. The potential field is given by a 
function V:R -» R and the trajectory is a solution of the differential equation 

(f 
m-^x{t) = -&Bd{V){x{t)). 

Note this model assumes the particle is a point mass, which is certainly known to be 
false in many cases we use this model, for example, as a model of planetary motion. 

• Model of rational behavior for a consumer. In this model we assume a consumer faces a 
choice of n commodities labeled l,2,...,n each with a market price p., p„,..., p . The 
consumer is assumed to have a cardinal utility function U (cardinal in the sense that it 
assigns numerical values to utilities), depending on the amounts of commodities x , x ,..., 
x consumed. The model further assumes that the consumer has a budget M which she 
uses to purchase a vector x , x ,..., x in such a way as to maximize U(x , x ,..., x ). The 
problem of rational behavior in this model then becomes an optimization problem, that is: 

msxU{x 1 ,X2, . . . ,x n ) 
subject to: 



^PiXi < M. 



i=i 



x { > Vi £ {l,2,...,7i} 
This model has been used in general equilibrium theory, particularly to show existence 
and Pareto optimality of economic equilibria. However, the fact that this particular 
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formulation assigns numerical values to levels of satisfaction is the source of criticism 
(and even ridicule). However, it is not an essential ingredient of the theory and again 
this is an idealization. 

• Neighbour-sensing model explains the mushroom formation from the initially chaotic 
fungal network. 

Modeling contains selecting and identifying relevant aspects of a situation in real world. 

Background 

Often when engineers analyze a system to be controlled or optimized, they use a 
mathematical model. In analysis, engineers can build a descriptive model of the system as a 
hypothesis of how the system could work, or try to estimate how an unforeseeable event 
could affect the system. Similarly, in control of a system, engineers can try out different 
control approaches in simulations. 

A mathematical model usually describes a system by a set of variables and a set of 
equations that establish relationships between the variables. The values of the variables 
can be practically anything; real or integer numbers, boolean values or strings, for example. 
The variables represent some properties of the system, for example, measured system 
outputs often in the form of signals, timing data, counters, and event occurrence (yes/no). 
The actual model is the set of functions that describe the relations between the different 
variables. 

Building blocks 

There are six basic groups of variables: decision variables, input variables, state variables, 
exogenous variables, random variables, and output variables. Since there can be many 
variables of each type, the variables are generally represented by vectors. 

Decision variables are sometimes known as independent variables. Exogenous variables are 
sometimes known as parameters or constants. The variables are not independent of each 
other as the state variables are dependent on the decision, input, random, and exogenous 
variables. Furthermore, the output variables are dependent on the state of the system 
(represented by the state variables). 

Objectives and constraints of the system and its users can be represented as functions of 
the output variables or state variables. The objective functions will depend on the 
perspective of the model's user. Depending on the context, an objective function is also 
known as an index of performance, as it is some measure of interest to the user. Although 
there is no limit to the number of objective functions and constraints a model can have, 
using or optimizing the model becomes more involved (computationally). 

Classifying mathematical models 

Many mathematical models can be classified in some of the following ways: 

1. Linear vs. nonlinear: Mathematical models are usually composed by variables, which 
are abstractions of quantities of interest in the described systems, and operators that act 
on these variables, which can be algebraic operators, functions, differential operators, 
etc. If all the operators in a mathematical model present linearity, the resulting 
mathematical model is defined as linear. A model is considered to be nonlinear 
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otherwise. 

The question of linearity and nonlinearity is dependent on context, and linear models may 
have nonlinear expressions in them. For example, in a statistical linear model, it is 
assumed that a relationship is linear in the parameters, but it may be nonlinear in the 
predictor variables. Similarly, a differential equation is said to be linear if it can be 
written with linear differential operators, but it can still have nonlinear expressions in it. 
In a mathematical proqramminq model, if the objective functions and constraints are 
represented entirely by linear equations, then the model is reqarded as a linear model. If 
one or more of the objective functions or constraints are represented with a nonlinear 
equation, then the model is known as a nonlinear model. 

Nonlinearity, even in fairly simple systems, is often associated with phenomena such as 
chaos and irreversibility. Althouqh there are exceptions, nonlinear systems and models 
tend to be more difficult to study than linear ones. A common approach to nonlinear 
problems is linearization, but this can be problematic if one is tryinq to study aspects 
such as irreversibility, which are stronqly tied to nonlinearity. 

2. Deterministic vs. probabilistic (stochastic): A deterministic model is one in which 
every set of variable states is uniquely determined by parameters in the model and by 
sets of previous states of these variables. Therefore, deterministic models perform the 
same way for a qiven set of initial conditions. Conversely, in a stochastic model, 
randomness is present, and variable states are not described by unique values, but rather 
by probability distributions. 

3. Static vs. dynamic: A static model does not account for the element of time, while a 
dynamic model does. Dynamic models typically are represented with difference equations 
or differential equations. 

4. Lumped vs. distributed parameters: If the model is homoqeneous (consistent state 
throuqhout the entire system) the parameters are distributed. If the model is 
heteroqeneous (varyinq state within the system), then the parameters are lumped. 
Distributed parameters are typically represented with partial differential equations. 

A priori information 

Mathematical modelinq problems are often classified into black box or white box models, 
accordinq to how much a priori information is available of the system. A black-box model is 
a system of which there is no a priori information available. A white-box model (also called 
qlass box or clear box) is a system where all necessary information is available. Practically 
all systems are somewhere between the black-box and white-box models, so this concept 
only works as an intuitive quide for approach. 

Usually it is preferable to use as much a priori information as possible to make the model 
more accurate. Therefore the white-box models are usually considered easier, because if 
you have used the information correctly, then the model will behave correctly. Often the a 
priori information comes in forms of knowinq the type of functions relatinq different 
variables. For example, if we make a model of how a medicine works in a human system, we 
know that usually the amount of medicine in the blood is an exponentially decayinq 
function. But we are still left with several unknown parameters; how rapidly does the 
medicine amount decay, and what is the initial amount of medicine in blood? This example 
is therefore not a completely white-box model. These parameters have to be estimated 
throuqh some means before one can use the model. 
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In black-box models one tries to estimate both the functional form of relations between 
variables and the numerical parameters in those functions. Using a priori information we 
could end up, for example, with a set of functions that probably could describe the system 
adequately. If there is no a priori information we would try to use functions as general as 
possible to cover all different models. An often used approach for black-box models are 
neural networks which usually do not make assumptions about incoming data. The problem 
with using a large set of functions to describe a system is that estimating the parameters 
becomes increasingly difficult when the amount of parameters (and different types of 
functions) increases. 

Subjective information 

Sometimes it is useful to incorporate subjective information into a mathematical model. 
This can be done based on intuition, experience, or expert opinion, or based on convenience 
of mathematical form. Bayesian statistics provides a theoretical framework for 
incorporating such subjectivity into a rigorous analysis: one specifies a prior probability 
distribution (which can be subjective) and then updates this distribution based on empirical 
data. An example of when such approach would be necessary is a situation in which an 
experimenter bends a coin slightly and tosses it once, recording whether it comes up heads, 
and is then given the task of predicting the probability that the next flip comes up heads. 
After bending the coin, the true probability that the coin will come up heads is unknown, so 
the experimenter would need to make an arbitrary decision (perhaps by looking at the 
shape of the coin) about what prior distribution to use. Incorporation of the subjective 
information is necessary in this case to get an accurate prediction of the probability, since 
otherwise one would guess 1 or as the probability of the next flip being heads, which 
would be almost certainly wrong. 

Complexity 

In general, model complexity involves a trade-off between simplicity and accuracy of the 
model. Occam's Razor is a principle particularly relevant to modeling; the essential idea 
being that among models with roughly equal predictive power, the simplest one is the most 
desirable. While added complexity usually improves the fit of a model, it can make the 
model difficult to understand and work with, and can also pose computational problems, 
including Numerical instability. Thomas Kuhn argues that as science progresses, 
explanations tend to become more complex before a Paradigm shift offers radical 
simplification. 

For example, when modeling the flight of an aircraft, we could embed each mechanical part 
of the aircraft into our model and would thus acquire an almost white-box model of the 
system. However, the computational cost of adding such a huge amount of detail would 
effectively inhibit the usage of such a model. Additionally, the uncertainty would increase 
due to an overly complex system, because each separate part induces some amount of 
variance into the model. It is therefore usually appropriate to make some approximations to 
reduce the model to a sensible size. Engineers often can accept some approximations in 
order to get a more robust and simple model. For example Newton's classical mechanics is 
an approximated model of the real world. Still, Newton's model is quite sufficient for most 
ordinary-life situations, that is, as long as particle speeds are well below the speed of light, 
and we study macro-particles only. 



Mathematical model 82 

Training 

Any model which is not pure white-box contains some parameters that can be used to fit the 
model to the system it shall describe. If the modelling is done by a neural network, the 
optimization of parameters is called training. In more conventional modelling through 
explicitly given mathematical functions, parameters are determined by curve fitting. 

Model evaluation 

A crucial part of the modeling process is the evaluation of whether or not a given 
mathematical model describes a system accurately. This question can be difficult to answer 
as it involves several different types of evaluation. 

Fit to empirical data 

Usually the easiest part of model evaluation is checking whether a model fits experimental 
measurements or other empirical data. In models with parameters, a common approach to 
test this fit is to split the data into two disjoint subsets: training data and verification data. 
The training data are used to estimate the model parameters. An accurate model will 
closely match the verification data even though this data was not used to set the model's 
parameters. This practice is referred to as cross-validation in statistics. 

Defining a metric to measure distances between observed and predicted data is a useful 
tool of assessing model fit. In statistics, decision theory, and some economic models, a loss 
function plays a similar role. 

While it is rather straightforward to test the appropriateness of parameters, it can be more 
difficult to test the validity of the general mathematical form of a model. In general, more 
mathematical tools have been developed to test the fit of statistical models than models 
involving Differential equations. Tools from nonparametric statistics can sometimes be used 
to evaluate how well data fits a known distribution or to come up with a general model that 
makes only minimal assumptions about the model's mathematical form. 

Scope of the model 

Assessing the scope of a model, that is, determining what situations the model is applicable 
to, can be less straightforward. If the model was constructed based on a set of data, one 
must determine for what systems or situations the data is a typical set of data from. 

The question of whether the model describes well the properties of the system between 
data points is called interpolation, and the same question for events or data points outside 
the observed data is called extrapolation. 

As an example of the typical limitations of the scope of a model, in evaluating Newtonian 
classical mechanics, we can note that Newton made his measurements without advanced 
equipment, so he could not measure properties of particles travelling at speeds close to the 
speed of light. Likewise, he did not measure the movements of molecules and other small 
particles, but macro particles only. It is then not surprising that his model does not 
extrapolate well into these domains, even though his model is quite sufficient for ordinary 
life physics. 
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Philosophical considerations 

Many types of modeling implicitly involve claims about causality. This is usually (but not 
always) true of models involving differential equations. As the purpose of modeling is to 
increase our understanding of the world, the validity of a model rests not only on its fit to 
empirical observations, but also on its ability to extrapolate to situations or data beyond 
those originally described in the model. One can argue that a model is worthless unless it 
provides some insight which goes beyond what is already known from direct investigation 
of the phenomenon being studied. 

An example of such criticism is the argument that the mathematical models of Optimal 
foraging theory do not offer insight that goes beyond the common-sense conclusions of 
evolution and other basic principles of ecology. . 

See also 
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Mathematical chemistry 

Mathematical chemistry is the area of research engaged in the novel and nontrivial 
applications of mathematics to chemistry; it concerns itself principally with the 
mathematical modeling of chemical phenomena. Mathematical chemistry has also 
sometimes been called computer chemistry, but should not be confused with 
computational chemistry. 

Major areas of research in mathematical chemistry include chemical graph theory, which 
deals with topics such as the mathematical study of isomerism and the development of 
topological descriptors or indices which find application in quantitative structure-property 
relationships; chemical aspects of group theory, which finds applications in stereochemistry 
and -» quantum chemistry; and topological aspects of chemistry. 

The history of the approach may be traced back into 18th century. Georg Helm published a 
treatise titled "The Principles of Mathematical Chemistry: The Energetics of Chemical 

T21 

Phenomena" in 1894 . Some of the more contemporary periodical publications 
specializing in the field are MATCH Communications in Mathematical and in Computer 
Chemistry, first published in 1975, and the Journal of Mathematical Chemistry, first 
published in 1987. 

The basic models for mathematical chemistry are molecular graph and topological index. 

See also 
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• Computational chemistry 

• Combinatorial chemistry 

• Molecular modeling 
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Molecular graphics (MG) is the discipline and philosophy of studying molecules and their 
properties through graphical representation. IUPAC limits the definition to 
representations on a "graphical display device". Ever since Dalton's atoms and Kekule's 
benzene, there has been a rich history of hand-drawn atoms and molecules, and these 
representations have had an important influence on modern molecular graphics. This 
article concentrates on the use of computers to create molecular graphics. Note, however, 
that many molecular graphics programs and systems have close coupling between the 
graphics and editing commands or calculations such as in -» molecular modelling. 



Relation to molecular models 

There has been a long tradition of creating 
molecular models from physical materials. 
Perhaps the best known is Crick and 
Watson's model of DNA built from rods and 
planar sheets, but the most widely used 
approach is to represent all atoms and 
bonds explicitly using the "ball and stick" 
approach. This can demonstrate a wide 
range of properties, such as shape, relative 
size, and flexibility. Many chemistry 
courses expect that students will have 
access to ball and stick models. One goal of 
mainstream molecular graphics has been to 
represent the "ball and stick" model as 
realistically as possible and to couple this 
with calculations of molecular properties. 

Figure 1 shows a small molecule (NH CH CH C(OH)(P0 3 H)(P0 3 H)-), as drawn by the Jmol 
program. It is important to realise that the colours are purely a convention. Molecules can 
never be visible under any light microscope and atoms are not coloured, do not have hard 
surfaces and do not reflect light. Bonds are not rod-shaped. If physical molecular models 
had not existed, it is unlikely that molecular graphics would currently use this metaphor. 




Fig. 1. Key: Hydrogen = white, carbon = grey, 

nitrogen = blue, oxygen = red, and phosphorus ■ 

orange. 
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Comparison of physical models with molecular graphics 

Physical models and computer models have partially complementary strengths and 
weaknesses. Physical models can be used by those without access to a computer and now 
can be made cheaply out of plastic materials. Their tactile and visual aspects cannot be 
easily reproduced by computers (although haptic devices have occasionally been built). On 
a computer screen, the flexibility of molecules is also difficult to appreciate; illustrating the 
pseudorotation of cyclohexane is a good example of the value of mechanical models. 

However, it is difficult to build large physical molecules, and all-atom physical models of 
even simple proteins could take weeks or months to build. Moreover, physical models are 
not robust and they decay over time. Molecular graphics is particularly valuable for 
representing global and local properties of molecules, such as electrostatic potential. 
Graphics can also be animated to represent molecular processes and chemical reactions, a 
feat that is not easy to reproduce physically. 

History 

Initially the rendering was on early CRT screens or through plotters drawing on paper. 
Molecular structures have always been an attractive choice for developing new computer 
graphics tools, since the input data are easy to create and the results are usually highly 
appealing. The first example of MG was a display of a protein molecule (Project MAC, 1966) 
by Cyrus Levin thai and Robert Langridge. Among the milestones in high-performance MG 
was the work of Nelson Max in "realistic" rendering of macromolecules using reflecting 
spheres. 

By about 1980 many laboratories both in academia and industry had recognized the power 
of the computer to analyse and predict the properties of molecules, especially in materials 
science and the pharmaceutical industry. The discipline was often called "molecular 
graphics" and in 1982 a group of academics and industrialists in the UK set up the 
Molecular Graphics Society (MGS). Initially much of the technology concentrated either on 
high-performance 3D graphics, including interactive rotation or 3D rendering of atoms as 
spheres (sometimes with radiosity). During the 1980s a number of programs for calculating 
molecular properties (such as -» molecular dynamics and quantum mechanics) became 
available and the term "molecular graphics" often included these. As a result the MGS has 
now changed its name to the Molecular Graphics and Modelling Society (MGMS). 

The requirements of macromolecular crystallography also drove MG because the traditional 
techniques of physical model-building could not scale. Alwyn Jones' FRODO program (and 
later "O") were developed to overlay the molecular electron density determined from X-ray 
crystallography and the hypothetical molecular structure. 
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Art, science and technology in molecular graphics 

Both computer technology and graphic arts have 
contributed to molecular graphics. The development 
of structural biology in the 1950s led to a 
requirement to display molecules with thousands of 
atoms. The existing computer technology was 
limited in power, and in any case a naive depiction 
of all atoms left viewers overwhelmed. Most systems 
therefore used conventions where information was 
implicit or stylistic. Two vectors meeting at a point 
implied an atom or (in macromolecules) a complete 
residue (10-20 atoms). 

The macromolecular approach was popularized by 
Dickerson and Geis' presentation of proteins and the 
graphic work of Jane Richardson through 
high-quality hand-drawn diagrams such as the 
"ribbon" representation. In this they strove to 
capture the intrinsic 'meaning' of the molecule. This 
search for the "messages in the molecule" has 
always accompanied the increasing power of 
computer graphics processing. Typically the 
depiction would concentrate on specific areas of the 
molecule (such as the active site) and this might 
have different colours or more detail in the number 
of explicit atoms or the type of depiction (e.g., 
spheres for atoms). 




Fig. 2. Image of hemagglutinin with alpha 

helices depicted as cylinders and the rest 

of the chain as silver coils. The individual 

protein atoms (several thousand) have 

been hidden. All of the non-hydrogen atoms 

in the two ligands (presumably sialic acid) 

have been shown near the top of the 

diagram. Key: Carbon = grey, oxygen = 

red, nitrogen = blue. 



In some cases the limitations of technology have led 
to serendipitous methods for rendering. Most early graphics devices used vector graphics, 
which meant that rendering spheres and surfaces was impossible. Michael Connolly's 
program "MS" calculated points on the surface-accessible surface of a molecule, and the 
points were rendered as dots with good visibility using the new vector graphics technology, 
such as the Evans and Sutherland PS300 series. Thin sections ("slabs") through the 
structural display showed very clearly the complementarity of the surfaces for molecules 
binding to active sites, and the "Connolly surface" became a universal metaphor. 

The relationship between the art and science of molecular graphics is shown in the 
exhibitions sponsored by the Molecular Graphics Society. Some exhibits are created with 
molecular graphics programs alone, while others are collages, or involve physical materials. 
An example from Mike Hann (1994), inspired by Magritte's painting Ceci n'est pas une 
pipe, uses an image of a salmeterol molecule. 

"Ceci n'est pas une molecule," writes Mike Hann, "serves to remind us that all of the 
graphics images presented here are not molecules, not even pictures of molecules, but 
pictures of icons which we believe represent some aspects of the molecule's properties." 
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Space-filling models 

Fig. 4 is a "space-filling" representation of formic acid, 

where atoms are drawn to suggest the amount of space 

they occupy. This is necessarily an icon: in the quantum 

mechanical representation of molecules, there are only 

(positively charged) nuclei and a "cloud" of negative 

electrons. The electron cloud defines an approximate 

size for the molecule, though there can be no single 

precise definition of size. For many years the size of 

atoms has been approximated by mechanical models 

(CPK), where the atoms have been represented by 

plastic spheres whose radius (van der Waals radius) 

describes a sphere within which "most" of the electron 

density can be found. These spheres could be clicked 

together to show the steric aspects of the molecule 

rather than the positions of the nuclei. Fig. 4 shows the 

intricacy required to make sure that all spheres intersect correctly, and also demonstrates 

a reflective model. 




Fig. 4. Space-filling model of formic 

acid. Key: Hydrogen = white, carbon = 

black, oxygen = red. 
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Fig. 5. A molecule (zirconocene) where 


part (left) is rendered as ball-and-stick 


and part Q 


-ight) as an isosurface. 



Since the atomic radii (e.g. in Fig. 4) are only slightly 
less than the distance between bonded atoms, the 
iconic spheres intersect, and in the CPK models, this 
was achieved by planar truncations along the bonding 
directions, the section being circular. When raster 
graphics became affordable, one of the common 
approaches was to replicate CPK models in silico. It is 
relatively straightforward to calculate the circles of 
intersection, but more complex to represent a model 
with hidden surface removal. A useful side product is 
that a conventional value for the molecular volume can 
be calculated. 



The use of spheres is often for convenience, being 
limited both by graphics libraries and the additional effort required to compute complete 
electronic density or other space-filling quantities. It is now relatively common to see 
images of isosurfaces that have been coloured to show quantities such as electrostatic 
potential. The commonest isosurfaces are the Connolly surface, or the volume within which 
a given proportion of the electron density lies. The isosurface in Fig. 5 appears to show the 
electrostatic potential, with blue colours being negative and red/yellow (near the metal) 
positive. (There is no absolute convention of colouring, and red/positive, blue/negative are 
often confusingly reversed!) Opaque isosurfaces do not allow the atoms to be seen and 
identified and it is not easy to deduce them. Because of this, isosurfaces are often drawn 
with a degree of transparency. 
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Technology 

Molecular graphics has always pushed the limits of display technology, and has seen a 
number of cycles of integration and separation of compute-host and display. Early systems 
like Project MAC were bespoke and unique, but in the 1970s the MMS-X and similar 
systems used (relatively) low-cost terminals, such as the Tektronix 4014 series, often over 
dial-up lines to multi-user hosts. The devices could only display static pictures but, were 
able to evangelize MG. In the late 1970s, it was possible for departments (such as 
crystallography) to afford their own hosts (e.g., PDP-11) and to attach a display (such as 
Evans & Sutherland's MPS) directly to the bus. The display list was kept on the host, and 
interactivity was good since updates were rapidly reflected in the display— at the cost of 
reducing most machines to a single-user system. 

In the early 1980s, Evans & Sutherland (E&S) decoupled their PS300 display, which 
contained its own display information transformable through a dataflow architecture. 
Complex graphical objects could be downloaded over a serial line (e.g. 9600 baud) and then 
manipulated without impact on the host. The architecture was excellent for high 
performance display but very inconvenient for domain-specific calculations, such as 
electron-density fitting and energy calculations. Many crystallographers and modellers 
spent arduous months trying to fit such activities into this architecture. 

The benefits for MG were considerable, but by the later 1980s, UNIX workstations such as 
Sun-3 with raster graphics (initially at a resolution of 256 by 256) had started to appear. 
Computer-assisted drug design in particular required raster graphics for the display of 
computed properties such as atomic charge and electrostatic potential. Although E&S had a 
high-end range of raster graphics (primarily aimed at the aerospace industry) they failed to 
respond to the low-end market challenge where single users, rather than engineering 
departments, bought workstations. As a result the market for MG displays passed to Silicon 
Graphics, coupled with the development of minisupercomputers (e.g., CONVEX and Alliant) 
which were affordable for well-supported MG laboratories. Silicon Graphics provided a 
graphics language, IrisGL, which was easier to use and more productive than the PS300 
architecture. Commercial companies (e.g., Biosym, Polygen/MSI) ported their code to 
Silicon Graphics, and by the early 1990s, this was the "industry standard". 

Stereoscopic displays were developed based on liquid crystal polarized spectacles, and 
while this had been very expensive on the PS300, it now became a commodity item. A 
common alternative was to add a polarizable screen to the front of the display and to 
provide viewers with extremely cheap spectacles with orthogonal polarization for separate 
eyes. With projectors such as Barco, it was possible to project stereoscopic display onto 
special silvered screens and supply an audience of hundreds with spectacles. In this way 
molecular graphics became universally known within large sectors of chemical and 
biochemical science, especially in the pharmaceutical industry. Because the backgrounds of 
many displays were black by default, it was common for modelling sessions and lectures to 
be held with almost all lighting turned off. 

In the last decade almost all of this technology has become commoditized. IrisGL evolved to 
OpenGL so that molecular graphics can be run on any machine. In 1992, Roger Sayle 
released his RasMol program into the public domain. RasMol contained a very 
high-performance molecular renderer that ran on Unix/X Window, and Sayle later ported 
this to the Windows and Macintosh platforms. The Richardsons developed kinemages and 
the Mage software, which was also multi-platform. By specifying the chemical MIME type, 
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molecular models could be served over the Internet, so that for the first time MG could be 
distributed at zero cost regardless of platform. In 1995, Birkbeck College's crystallography 
department used this to run "Principles of Protein Structure", the first multimedia course 
on the Internet, which reached 100 to 200 scientists. 
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Fig. 6. A molecule of Porin (protein) shown without ambient occlusion (left) and with (right). Advanced rendering 
effects can improve the comprehension of the 3D shape of a molecule. 

MG continues to see innovation that balances technology and art, and currently zero-cost or 
open source programs such as PyMOL and Jmol have very wide use and acceptance. 

Recently the wide spread diffusion of advanced graphics hardware, has improved the 
rendering capabilities of the visualization tools. The capabilities of current shading 
languages allow the inclusion of advanced graphic effects (like ambient occlusion, cast 
shadows and non-photorealistic rendering techniques) in the interactive visualization of 
molecules. These graphic effects, beside being eye candy, can improve the comprehension 
of the three dimensional shapes of the molecules. An example of the effects that can be 
achieved exploiting recent graphics hardware can be seen in the simple open source 
visualization system QuteMol. 



Algorithms 



Reference frames 

Drawing molecules requires a transformation between molecular coordinates (usually, but 
not always, in Angstrom units) and the screen. Because many molecules are chiral it is 
essential that the handedness of the system (almost always right-handed) is preserved. In 
molecular graphics the origin (0, 0) is usually at the lower left, while in many computer 
systems the origin is at top left. If the z-coordinate is out of the screen (towards the viewer) 
the molecule will be referred to right-handed axes, while the screen display will be 
left-handed. 

Molecular transformations normally require: 

• scaling of the display (but not the molecule). 

• translations of the molecule and objects on the screen. 

• rotations about points and lines. 

Conformational changes (e.g. rotations about bonds) require rotation of one part of the 
molecule relative to another. The programmer must decide whether a transformation on the 
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screen reflects a change of view or a change in the molecule or its reference frame. 



Simple 




Fig. 7. Stick model of caffeine drawn in 
Jmol. 



In early displays only vectors could be drawn e.g. (Fig. 
7) which are easy to draw because no rendering or 
hidden surface removal is required. 

On vector machines the lines would be smooth but on 
raster devices Bresenham's algorithm is used (note the 
"jaggies" on some of the bonds, which can be largely 
removed with antialiasing software.) 

Atoms can be drawn as circles, but these should be 
sorted so that those with the largest z-coordinates 
(nearest the screen) are drawn last. Although 
imperfect, this often gives a reasonably attractive 
display. Other simple tricks which do not include 
hidden surface algorithms are: 



• colouring each end of a bond with the same colour as the atom to which it is attached 
(Fig. 7). 

• drawing less than the whole length of the bond (e.g. 10%-90%) to simulate the bond 
sticking out of a circle. 

• adding a small offset white circle within the circle for an atom to simulate reflection. 

Typical pseudocode for creating Fig. 7 (to fit the molecule exactly to the screen): 
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Note that this assumes the origin is in the bottom left corner of the screen, with Y up the 
screen. Many graphics systems have the origin at the top left, with Y down the screen. In 
this case the lines (1) and (2) should have the y coordinate generation as: 

yO = yScreenMax - (yOf f set+atom0.getY( )*scale) // (1) 
yl = yScreenMax - (yOf f set+atoml. getY( )*scale) // (2) 

Changes of this sort change the handedness of the axes so it is easy to reverse the chirality 
of the displayed molecule unless care is taken. 

Advanced 

For greater realism and better comprehension of the 3D structure of a molecule many 
computer graphics algorithms can be used. For many years molecular graphics has 
stressed the capabilities of graphics hardware and has required hardware-specific 
approaches. With the increasing power of machines on the desktop, portability is more 
important and programs such as Jmol have advanced algorithms that do not rely on 
hardware. On the other hand recent graphics hardware is able to interactively render very 
complex molecule shapes with a quality that would not be possible with standard software 
techniques. 



Chronology 



This table provides an incomplete chronology of molecular graphics advances. 


Developer(s) 


Approximate 
date 


Technology 


Comments 


Crystallographers 


< 1960 


Hand-drawn 


Crystal structures, with hidden atom 
and bond removal. Often 
clinographic projections. 


Cyrus Levinthal, Bob 
Langridge 


1960s 


CRT 


First protein display on screen 
(Project MAC). 


Johnson, Motherwell 


ca 1970 


Pen plotter 


ORTEP, PLUTO. Very widely 
deployed for publishing crystal 
structures. 


Langridge, White, 
Marshall 


Late 1970s 


Departmental systems 
(PDP-11, Tektronix displays 
or DEC-VT11, e.g. MMS-X) 


Mixture of commodity computing 
with early displays. 


T. Alwyn Jones 


1978 


FRODO 


Crystallographic structure solution. 


Davies, Hubbard 


Mid-1980s 


CHEM-X, HYDRA 


Laboratory systems with multicolor, 
raster and vector devices (Sigmex, 
PS300). 


Biosym, Tripos, Polygen 


Mid-1980s 


PS300 and lower cost dumb 
terminals (VT200, SIGMEX) 


Commercial integrated modelling 
and display packages. 


Silicon Graphics, Sun 


Late 1980s 


IRIS GL (UNIX) workstations 


Commodity-priced single-user 
workstations with stereoscopic 
display. 


EMBL - WHAT IF [4] 


1989, 2000 


Machine independent 


Nearly free, multifunctional, still 
fully supported, many free servers 
based on it 


Sayle, Richardson 


1992, 1993 


RasMol, Kinemage 


Platform-independent MG. 
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MDL (van Vliet, Maffett, 
Adler, Holt) 


1995-1998 


Chime 


proprietary C++ ; free browser 
plugin for Mac (OS9) and PCs 


ChemAxon 


1998- 


MarvinSketch [6] & 
MarvinView 
MarvinSpace [8] (2005) 


proprietary Java applet or 
stand-alone application. 


Community efforts 


2000- 


Jmol, PyMol, Protein 
Workshop (www.pdb.org) 


Open-source Java applet or 
stand-alone application. 


San Diego 
Supercomputer Center 


2006- 


Sirius 


Free for academic/non-profit 
institutions 


NOCH 


2002- 


T91 
NOC L J 


Powerful and open source code 
molecular structure explorer 


Weizmann Institute of 
Science - Community 
efforts 


2008- 


Proteopedia 


Collaborative, 3D wiki encyclopedia 
of proteins & other molecules 
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See also 

• Molecular Design software 

• Molecular model 

• -» Molecular modelling 

• Molecular geometry 

• Software for molecular mechanics modeling 

External links 



• The PyMOL Molecular Graphics System (http://pymol.sf.net) -- open source 

• PyMOLWiki (http://pymolwiki.org) -- community supported wiki for PyMOL 

• History of Visualization of Biological Macromolecules (http://www.umass.edu/ 
microbio/rasmol/history.htm) by Eric Martz and Eric Francoeur. 

• Brief History of Molecular Mechanics/Graphics (http://stanley.chem.lsu.edu/webpub/ 
7770-l_ecture-l-intro.pdf) in LSU CHEM7770 lecture notes. 

• Historical slides (http://luminary.stanford.edu/langridge/slides.htm) from Robert 
(Bob) Langridge. These show the influence of Crick and Watson on molecular graphics 
(including Levinthal's) and the development of early display technology, finishing with 
displays which were common in the mid-1980s on machines such as Evans and 
Sutherland's PS300 series. 
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Interview with Langridge. (http://luminary.stanford.edu/langridge/langridge.html) The 

display looking down the axis of B-DNA has been likened to a rose window. 

Nelson Max's home page (http://accad.osu.edu/~waynec/history/tree/max.html) with 

links to 1982 classics. 

Jmol home page (http://jmol.sourceforge.net/) contains an applet with an automatic 

display of many features of molecular graphics including metaphors, scripting, 

annotation and animation. 

Richardson Lab (http://kinemage.biochem.duke.edu/) includes Kinemage and 

molecular graphics images. 

History of RasMol. (http://www.openrasmol.org/history.html) 

Molecule of the Month (http://www. rcsb.org/pdb/static. do?p=education_discussion/ 

molecule_of_the_month/index.html) at RCSB/PDB. 

xeo (http://sourceforge.net/projects/xeo) xeo is a free (GPL) open project management 

for nanostructures using Java 

Exhibitions of Molecular Graphics Art (http://www.scripps.edu/mb/goodsell/mgs_art/ 

), 1994, 1998. 

NOCH home page (http://noch.sourceforge.net) A powerful, efficient and open source 

molecular graphics tool. 

eMovie (http://www.weizmann.ac.il/ISPC/eMovie.html): a tool for creation of 

molecular animations with PyMOL. 

Proteopedia (http://www.proteopedia.org): The collaborative, 3D encyclopedia of 

proteins and other molecules. 

Ascalaph Graphics (http://www.agilemolecule.com/Ascalaph/Ascalaph_Graphics.html): 

a molecular viewer with some geometry editing capabilities. 

Molecular Graphics and Modelling Society, (http://www.mgms.org/) 

Journal of Molecular Graphics and Modelling (http://www.sciencedirect.com/ 

science?_ob=JournalURL&_cdi=5260&_auth=y&_acct=C000053194&_version=l& 

_urlVersion=0&_userid = 1495569&md5 = le86bcce088e98890cea52f6eda84b64) 

(formally Journal of Molecular Graphics). This journal is not open access. 



Complex dynamics 96 



Complex Systems Dynamics 



Complex dynamics 



Complex dynamics the study of dynamical systems for which the phase space is a complex 
manifold. Complex analytic dynamics specifies more precisely that it is analytic functions 
whose dynamics it is to study. 

Technics 111 

• Montel's theorem 

• Poincare metric 

• Schwarz lemma 

• Riemann mapping theorem 

• -» Caratheodory's theorem (conformal mapping) 

See also 

complex analysis 

Orbit portrait 

John Milnor 

Complex quadratic polynomial 

Fatou set 

Julia set 

Mandelbrot set 
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Topological dynamics 

In mathematics, topological dynamics is a branch of the theory of dynamical systems in 
which qualitative, asymptotic properties of dynamical systems are studied from the 
viewpoint of general topology. 

Scope 

The central object of study in topological dynamics is a topological dynamical system, 

i.e. a topological space, together with a continuous transformation, a continuous flow, or 
more generally, a semigroup of continuous transformations of that space. The origins of 
topological dynamics lie in the study of asymptotical properties of trajectories of systems of 
autonomous ordinary differential equations, in particular, the behavior of limit sets and 
various manifestations of "repetetiveness" of the motion, such as periodic trajectories, 
recurrence and minimality, stability, non-wandering points. George Birkhoff is considered 
to be the founder of the field. A structure theorem for minimal distal flows proved by Hillel 
Furstenberg in the early 1960s inspired much work on classification of minimal flows. A lot 
of research in the 1970s and 1980s was devoted to topological dynamics of one-dimensional 
maps, in particular, piecewise linear self-maps of the interval and the circle. 

Unlike the theory of smooth dynamical systems, where the main object of study is a smooth 
manifold with a diffeomorphism or a smooth flow, phase spaces considered in topological 
dynamics are general metric spaces (usually, compact). This necessitates development of 
entirely different techniques but allows extra degree of flexibility even in the smooth 
setting, because invariant subsets of a manifold are frequently very complicated 
topologically (cf limit cycle, strange attractor); additionally, shift spaces arising via 
symbolic representations can be considered on an equal footing with more geometric 
actions. Topological dynamics has intimate connections with ergodic theory of dynamical 
systems, and many fundamental concepts of the latter have topological analogues (cf 
Kolmogorov-Sinai entropy and topological entropy). 

See also 

• Poincare-Bendixson theorem 

• Symbolic dynamics 

• -» Topological conjugacy 
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Topological conjugacy 



In mathematics, two functions are said to be topologically conjugate to one another if 
there exists a homeomorphism that will conjugate the one into the other. Topological 
conjugacy is important in the study of iterated functions and more generally dynamical 
systems, since, if the dynamics of one iterated function can be solved, then those for any 
topologically conjugate function follow trivially. 

To illustrate this directly: suppose that f and g are iterated functions, and there exists an h 
such that 

g = h~ o f o h, 
so that /and g are topologically conjugate. Then of course one must have 

g n = h- 1 of n oh 7 

and so the iterated systems are conjugate as well. Here, ° denotes function composition. 

As examples, the logistic map and the tent map are topologically conjugate. Furthermore, 
the logisitic map of unit height and the Bernoulli map are topologically conjugate. 

Definition 

Let X and Y be topological spaces, and let f- X ^> X and 9- Y — *■ Y be continuous 
functions. We say that / is topologically semiconjugate to 9 , if there exists a 
continuous surjection h:Y —* X such that f°h = hog,\fh is a homeomorphism, then 
we say that / and 9 are topologically conjugate, and we call h a topological 
conjugation between / and 9 . 

Similarly, a flow *P on X is topologically semiconjugate to a flow i/' on Y if there is a 
continuous surjection h: Y -» X such that <p(h(y),t) = h4>(y,t) for each y £ Y, t £ M . If 
h is a homeomorphism then V 1 and V are topologically conjugate. 

Discussion 

Topological conjugation defines an equivalence relation in the space of all continuous 
surjections of a topological space to itself, by declaring / and 9 to be related if they are 
topologically conjugate. This equivalence relation is very useful in the theory of dynamical 
systems, since each class contains all functions which share the same dynamics from the 
topological viewpoint. For example, orbits of 9 are mapped to homeomorphic orbits of / 
through the conjugation. Writing g = h~ o f oh makes this fact evident: 
g n = h~ o /" o h . Speaking informally, topological conjugation is a "change of 
coordinates" in the topological sense. 

However, the analogous definition for flows is somewhat restrictive. In fact, we are 
requiring the maps ^(-,*) and V'('iO to be topologically conjugate for each t, which is 
requiring more than simply that orbits of ( r be mapped to orbits of V homeomorphically. 
This motivates the definition of topological eguivalence, which also partitions the set of 
all flows in X into classes of flows sharing the same dynamics, again from the topological 



Topological conjugacy 99 

viewpoint. 

Topological equivalence 

We say that V' and P are topologically equivalent, if there is a homeomorphism 
h:Y—>X, mapping orbits of V-' to orbits of ( P homeomorphically, and preserving 
orientation of the orbits. In other words, letting O denote an orbit, one has 
h(0(y,i>)) = {htyfat)) :teR} = M%),f) :teR} = 0(h(y),<p) 

for each y £ Y . in addition, one must line up the flow of time: for each y £ Y t there exists 
a 6 > such that, if < |s| < t < S t and if s is such that <p{h(y),s) = h(ii^y,t)), then 
5 > 0. 

Overall, topological eguivalence is a weaker equivalence criterion than topological 
conjugacy, as it does not require that the time term is mapped along with the orbits and 
their orientation. An example of a topologically equivalent but not topologically conjugate 
system would be the non-hyperbolic class of two dimensional systems of differential 
equations that have closed orbits. While the orbits can be transformed each other to 
overlap in the spatial sense, the periods of such systems cannot be analogously matched, 
thus failing to satisfy the topological conjugacy criterion while satisfying the topological 
equivalence criterion. 

Generalizations of dynamic topological conjugacy 

There are two reported extensions of the concept of dynamic topological conjugacy: 

1 . Analogous systems defined as isomorphic dynamical systems 

2. Adjoint dynamical systems defined via adjoint functors and natural equivalences in 
categorical dynamics . 

Cited References 

[1] http://planetphysics.org/encyclopedia/Complexity.html Complexity and Categorical Dynamics 
[2] http://planetphysics.org/encyclopedia/AnalogousSystems3.html Analogous systems, Topological Conjugacy 
and Adjoint Systems 

See also 

• Commutative diagram 

This article incorporates material from topological conjugation on PlanetMath, which is 
licensed under the GFDL. 
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Network dynamics 

Network dynamics is the study of networks that change in time. These networks could be 
from the fields of biology,sociology,economics, computer science, graph theory etc. 

For a dynamical systems approach to network dynamics, see sequential dynamical system. 

See also 

-» Dynamic network analysis 
-» Metastability in the brain 
Gaussian network model 
Neural network 
Cellular neural network 
Small-world network 
Network planning and design 
Dynamic Bayesian network 
Dynamic single-frequency networks 
Biological network inference 
Technology Dynamics 
Source-sink dynamics 
Social network analysis software 

Dynamic network analysis 

Dynamic network analysis (DNA) is an emergent scientific field that brings together 
traditional social network analysis (SNA), link analysis (LA) and multi-agent systems (MAS). 
There are two aspects of this field. The first is the statistical analysis of DNA data. The 
second is the utilization of simulation to address issues of network dynamics. DNA networks 
vary from traditional social networks in that they are larger, dynamic, multi-mode, 
multi-plex networks, and may contain varying levels of uncertainty. 

DNA statistical tools are generally optimized for large-scale networks and admit the 
analysis of multiple networks simultaneously in which, there are multiple types of nodes 
(multi-node) and multiple types of links (multi-plex). In contrast, SNA statistical tools focus 
on single or at most two mode data and facilitate the analysis of only one type of link at a 
time. 

DNA statistical tools tend to provide more measures to the user, because they have 
measures that use data drawn from multiple networks simultaneously. From a computer 
simulation perspective, nodes in DNA are like atoms in quantum theory, nodes can be, 
though need not be, treated as probabilistic. Whereas nodes in a traditional SNA model are 
static, nodes in a DNA model have the ability to learn. Properties change over time; nodes 
can adapt: A company's employees can learn new skills and increase their value to the 
network; Or, capture one terrorist and three more are forced to improvise. Change 
propagates from one node to the next and so on. DNA adds the element of a network's 
evolution and considers the circumstances under which change is likely to occur. 
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An example of a multi-entity, multi-network, dynamic network diagram 



Illustrative 
problems that 
people in the DNA 
area work on 

• Developing metrics and 
statistics to assess and 
identify change within and 
across networks. 

• Developing and validating 
simulations to study 
network change, evolution, 
adaptation, decay... See 

Computer simulation and organizational studies 

• Developing and validating formal models of network generation and evolution 

• Developing and testing theory of network change, evolution, adaptation, decay... 

• Developing techniques to visualize network change overall or at the node or group level 

• Developing statistical techniques to see whether differences observed over time in 
networks are due to simply different samples from a distribution of links and nodes or 
changes over time in the underlying distribution of links and nodes 

• Developing control processes for networks over time 

• Developing algorithms to change distributions of links in networks over time 

• Developing algorithms to track groups in networks over time. 

• Developing tools to extract or locate networks from various data sources such as texts. 

• Developing statistically valid measurements on networks over time. 

• Examining the robustness of network metrics under various types of missing data 

• Empirical studies of multi-mode multi-link multi-time period networks 

• Examining networks as probabilistic time-variant phenomena 

• Forecasting change in existing networks 

• Identifying trails through time given a sequence of networks. 

• Identifying changes in node criticality given a sequence of networks anything else related 
to multi-mode multi-link multi-time period networks. 

Kathleen Carley, of Carnegie Mellon University, is the leading authority in this field. 
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Further reading 

• Kathleen M. Carley, 2003, "Dynamic Network Analysis" in Dynamic Social Network 
Modeling and Analysis: Workshop Summary and Papers, Ronald Breiger, Kathleen 
Carley, and Philippa Pattison, (Eds.) Committee on Human Factors, National Research 
Council, National Research Council. Pp. 133-145, Washington, DC. 

• Kathleen M. Carley, 2002, "Smart Agents and Organizations of the Future" The 
Handbook of New Media. Edited by Leah Lievrouw and Sonia Livingstone, Ch. 12, pp. 
206-220, Thousand Oaks, CA, Sage. 

• Kathleen M. Carley, Jana Diesner, Jeffrey Reminga, Maksim Tsvetovat, 2008, Toward an 
Interoperable Dynamic Network Analysis Toolkit, DSS Special Issue on 
Cyberinfrastructure for Homeland Security: Advances in Information Sharing, Data 
Mining, and Collaboration Systems. Decision Support Systems 43(4):1324-1347 
(article 20 [2] ) 

See also 

• -» Network dynamics 

• Sequential dynamical system 

• Kathleen Carley 

• Network science 

• INSNA 

External links 

• Radcliffe Exploratory Seminar on Dynamic Networks 

• Center for Computational Analysis of Social and Organizational Systems (CASOS) 
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Theoretical biology 

Theoretical biology is a field of academic study and research that involves the use of 
models and theories in biology. 

Many separate areas of biology fall under the concept of theoretical biology, according to 
the way they are studied. Some of these areas include: animal behaviour (ethology), 
biomechanics, biorhythms, cell biology, complexity of biological systems, ecology, enzyme 
kinetics, evolutionary biology, genetics, immunology, membrane transport, microbiology, 
molecular structures, morphogenesis, physiological mechanisms, -» systems biology and the 
origin of life. Neurobiology is an example of a subdiscipline of biology which already has a 
theoretical version of its own, theoretical or computational neuroscience. 

The ultimate goal of the theoretical biologist is to explain the biological world using mainly 
mathematical and computational tools. Though it is ultimately based on observations and 
experimental results, the theoretical biologist's product is a model or theory, and it is this 
that chiefly distinguishes the theoretical biologist from other biologists. 

Theoretical biologists 

Pere Alberch 
Anthony F. Bartholomay 
Ludwig von Bertalanffy 
J. T. Bonner 
Jack Cowan 
Francis Crick 
Gerd B. Miiller 
Walter M. Elsasser 
Claus Emmeche 
Andree Ehresmann 
Marc Feldman 
Ronald A. Fisher 
Brian Goodwin 
Bryan Grenfell 
J. B. S. Haldane 
William D. Hamilton 
Lionel G. Harrison 
Michael Hassell 
Sven Erik Jorgensen 
George Karreman 
Stuart Kauffman 
Kalevi Kull 
Herbert D. Landahl 
Richard Lewontin 
Humberto Maturana 
Robert May 
John Maynard Smith 
James D. Murray 
Howard Pattee 
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George R. Price 

Erik Rauch 

Nicolas Rashevsky 

Ronald Brown (mathematician) 

Johannes Reinke 

Robert Rosen 

Peter Schuster 

Rene Thorn 

D'Arcy Thompson 

Jakob von Uexkiill 

Robert Ulanowicz 

Francisco Varela 

C. H. Waddington 

Arthur Winfree 

Lewis Wolpert 

Sewall Wright 

Christopher Zeeman 

See also 

• Journal of Theoretical Biology 

• Bioinformatics 

• Biosemiotics 

• -» Mathematical biology 

• Theoretical ecology 

• Artificial life 
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External links 

• Theory of Biological Anthropology (Documents No. 9 and 10 in English) 

• Drawing the Line Between Theoretical and Basic Biology (a forum article by Isidro T. 
Savillo) [2] 

Related Journals 

Acta Biotheoretica 

Bioinformatics 

Biological Theory 

BioSystems 

Bulletin of Mathematical Biology 

roi 

Ecological Modelling 

Journal of Mathematical Biology 

Journal of Theoretical Biology 

Journal of the Royal Society Interface 

ri2i 
Mathematical Biosciences 

Medical Hypotheses [13] 

Rivista di Biologia-Biology Forum 

Theoretical and Applied Genetics 

Theoretical Biology and Medical Modelling 

Theoretical Population Biology 

ri8i 
Theory in Biosciences (formerly: Biologisches Zentralblatt) 

Related societies 

American Mathematical Society 

British Society of Developmental Biology ' ' 

roil 

European Mathematical Society 

ESMTB: European Society for Mathematical and Theoretical Biology 

["231 

The International Biometric Society 

International Society for Ecological Modelling ' ' 

The Israeli Society for Theoretical and Mathematical Biology 

London Mathematical Society 

Societe Francophone de Biologie Theorique 

Society for Industrial and Applied Mathematics ' ' 

Society for Mathematical Biology 

International Society for Biosemiotic Studies *■ ' 



[16] 
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Mathematical biology 

Mathematical biology is also called theoretical biology/ and sometimes 
biomathematics. It includes at least four major subfields: biological mathematical 
modeling, relational biology/complex systems biology (CSB), bioinformatics and 
computational biomodeling/biocomputing. It is an interdisciplinary academic research field 
with a wide range of applications in biology, medicine and biotechnology. 

Mathematical biology aims at the mathematical representation, treatment and modeling of 
biological processes, using a variety of applied mathematical techniques and tools. It has 
both theoretical and practical applications in biological, biomedical and biotechnology 
research. For example, in cell biology, protein interactions are often represented as 
"cartoon" models, which, although easy to visualize, do not accurately describe the systems 
studied. In order to do this, precise mathematical models are required. By describing the 
systems in a quantitative manner, their behavior can be better simulated, and hence 
properties can be predicted that might not be evident to the experimenter. 

Importance 

Applying mathematics to biology has a long history, but only recently has there been an 
explosion of interest in the field. Some reasons for this include: 

• the explosion of data-rich information sets, due to the genomics revolution, which are 
difficult to understand without the use of analytical tools, 

• recent development of mathematical tools such as chaos theory to help understand 
complex, nonlinear mechanisms in biology, 

• an increase in computing power which enables calculations and simulations to be 
performed that were not previously possible, and 

• an increasing interest in in silico experimentation due to ethical considerations, risk, 
unreliability and other complications involved in human and animal research. 

For use of basic arithmetics in biology, see relevant topic, such as Serial dilution. 

Areas of research 

Several areas of specialized research in mathematical and theoretical biology 

rq] 

as well as external links to related projects in various universities are concisely 
presented in the following subsections, including also a large number of appropriate 
validating references from a list of several thousands of published authors contributing to 
this field. Many of the included examples are characterised by highly complex, nonlinear, 
and supercomplex mechanisms, as it is being increasingly recognised that the result of such 
interactions may only be understood through a combination of mathematical, logical, 
physical/chemical, molecular and computational models. Due to the wide diversity of 
specific knowledge involved, biomathematical research is often done in collaboration 
between mathematicians, biomathematicians, theoretical biologists, physicists, 
biophysicists, biochemists, bioengineers, engineers, biologists, physiologists, research 
physicians, biomedical researchers, oncologists, molecular biologists, geneticists, 
embryologists, zoologists, chemists, etc. 
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Computer models and automata theory 

A monograph on this topic summarizes an extensive amount of published research in this 
area up to 1987, including subsections in the following areas: computer modeling in 
biology and medicine, arterial system models, neuron models, biochemical and oscillation 
networks, quantum automata, quantum computers in molecular biology and genetics, 

cancer modelling, neural nets, genetic networks, abstract relational biology, 

ri2i ri3i 

metabolic-replication systems, category theory applications in biology and medicine, 

automata theory,cellular automata, tessallation models and complete 

self-reproduction , chaotic systems in organisms, relational biology and organismic 

theories. This published report also includes 390 references to peer-reviewed 

articles by a large number of authors. 

Modeling cell and molecular biology 

[221 

This area has received a boost due to the growing importance of molecular biology. 

T231 

Mechanics of biological tissues 

Theoretical enzymology and enzyme kinetics 

Cancer modelling and simulation 

Modelling the movement of interacting cell populations 

T271 

Mathematical modelling of scar tissue formation 
Mathematical modelling of intracellular dynamics 

T291 

Mathematical modelling of the cell cycle 



Modelling physiological systems 

• Modelling of arterial disease 

T311 

• Multi-scale modelling of the heart 



Molecular set theory- 
Molecular set theory was introduced by Anthony Bartholomay, and its applications were 
developed in mathematical biology and especially in Mathematical Medicine. Molecular 
set theory (MST) is a mathematical formulation of the wide-sense chemical kinetics of 
biomolecular reactions in terms of sets of molecules and their chemical transformations 
represented by set-theoretical mappings between molecular sets. In a more general sense, 
MST is the theory of molecular categories defined as categories of molecular sets and their 
chemical transformations represented as set-theoretical mappings of molecular sets. The 
theory has also contributed to biostatistics and the formulation of clinical biochemistry 
problems in mathematical formulations of pathological, biochemical changes of interest to 
Physiology, Clinical Biochemistry and Medicine. 

Population dynamics 

Population dynamics has traditionally been the dominant field of mathematical biology. 
Work in this area dates back to the 19th century. The Lotka-Volterra predator-prey 
equations are a famous example. In the past 30 years, population dynamics has been 
complemented by evolutionary game theory, developed first by John Maynard Smith. Under 
these dynamics, evolutionary biology concepts may take a deterministic mathematical form. 
Population dynamics overlap with another active area of research in mathematical biology: 
mathematical epidemiology, the study of infectious disease affecting populations. Various 
models of viral spread have been proposed and analyzed, and provide important results that 
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may be applied to health policy decisions. 

Mathematical methods 

A model of a biological system is converted into a system of equations, although the word 
'model' is often used synonymously with the system of corresponding equations. The 
solution of the equations, by either analytical or numerical means, describes how the 
biological system behaves either over time or at equilibrium. There are many different 
types of equations and the type of behavior that can occur is dependent on both the model 
and the equations used. The model often makes assumptions about the system. The 
equations may also make assumptions about the nature of what may occur. 

Mathematical biophysics 

The earlier stages of mathematical biology were dominated by mathematical biophysics, 
described as the application of mathematics in biophysics, often involving specific 
physical/mathematical models of biosystems and their components or compartments. 

The following is a list of mathematical descriptions and their assumptions. 

Deterministic processes (dynamical systems) 

A fixed mapping between an initial state and a final state. Starting from an initial condition 
and moving forward in time, a deterministic process will always generate the same 
trajectory and no two trajectories cross in state space. 

• Difference equations - discrete time, continuous state space. 

• Ordinary differential equations - continuous time, continuous state space, no spatial 
derivatives. See also: Numerical ordinary differential equations. 

• Partial differential equations - continuous time, continuous state space, spatial 
derivatives. See also: Numerical partial differential equations. 

• Maps - discrete time, continuous state space. 

Stochastic processes (random dynamical systems) 

A random mapping between an initial state and a final state, making the state of the system 
a random variable with a corresponding probability distribution. 

• Non-Markovian processes - generalized master equation - continuous time with memory 
of past events, discrete state space, waiting times of events (or transitions between 
states) discretely occur and have a generalized probability distribution. 

• Jump Markov process - master equation - continuous time with no memory of past 
events, discrete state space, waiting times between events discretely occur and are 
exponentially distributed. See also: -» Monte Carlo method for numerical simulation 
methods, specifically continuous-time Monte Carlo which is also called kinetic Monte 
Carlo or the stochastic simulation algorithm. 

• Continuous Markov process - stochastic differential equations or a Fokker-Planck 
equation - continuous time, continuous state space, events occur continuously according 
to a random Wiener process. 

Spatial modelling 

One classic work in this area is Alan Turing's paper on morphogenesis entitled The 
Chemical Basis of Morphogenesis, published in 1952 in the Philosophical Transactions of 
the Royal Society. 

• Travelling waves in a wound-healing assay 
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• Swarming behaviour ] 
A mechanochemical thee 
Biological pattern formation^ ] 



A mechanochemical theory of morphogenesis 



T391 

• Spatial distribution modeling using plot samples 

Phylogenetics 

Phylogenetics is an area of mathematical biology that deals with the reconstruction and 
analysis of phylogenetic (evolutionary) trees and networks based on inherited 
characteristics. The main mathematical concepts are trees, X-trees and maximum 
parsimony trees. 

Model example: the cell cycle 

The eukaryotic cell cycle is very complex and is one of the most studied topics, since its 
misregulation leads to cancers. It is possibly a good example of a mathematical model as it 
deals with simple calculus but gives valid results. Two research groups have 

produced several models of the cell cycle simulating several organisms. They have recently 
produced a generic eukaryotic cell cycle model which can represent a particular eukaryote 
depending on the values of the parameters, demonstrating that the idiosyncrasies of the 
individual cell cycles are due to different protein concentrations and affinities, while the 
underlying mechanisms are conserved (Csikasz-Nagy et al., 2006). 

By means of a system of ordinary differential equations these models show the change in 
time (dynamical system) of the protein inside a single typical cell; this type of model is 
called a deterministic process (whereas a model describing a statistical distribution of 
protein concentrations in a population of cells is called a stochastic process). 
To obtain these equations an iterative series of steps must be done: first the several models 
and observations are combined to form a consensus diagram and the appropriate kinetic 
laws are chosen to write the differential equations, such as rate kinetics for stoichiometric 
reactions, Michaelis-Menten kinetics for enzyme substrate reactions and 
Goldbeter-Koshland kinetics for ultrasensitive transcription factors, afterwards the 
parameters of the equations (rate constants, enzyme efficiency coefficients and Michealis 
constants) must be fitted to match observations; when they cannot be fitted the kinetic 
equation is revised and when that is not possible the wiring diagram is modified. The 
parameters are fitted and validated using observations of both wild type and mutants, such 
as protein half-life and cell size. 

In order to fit the parameters the differential equations need to be studied. This can be 
done either by simulation or by analysis. 

In a simulation, given a starting vector (list of the values of the variables), the progression 
of the system is calculated by solving the equations at each time-frame in small increments. 
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In analysis, the proprieties of 

the equations are used to 

investigate the behavior of the 

system depending of the 

values of the parameters and 

variables. A system of 

differential equations can be 

represented as a vector field, 

where each vector described 

the change (in concentration 

of two or more protein) 

determining where and how 

fast the trajectory (simulation) is heading. Vector fields can have several special points: a 

stable point, called a sink, that attracts in all directions (forcing the concentrations to be at 

a certain value), an unstable point, either a source or a saddle point which repels (forcing 

the concentrations to change away from a certain value), and a limit cycle, a closed 

trajectory towards which several trajectories spiral towards (making the concentrations 

oscillate). 

A better representation which can handle the large number of variables and parameters is 

called a bifurcation diagram(Bifurcation theory): the presence of these special steady-state 

points at certain values of a parameter (e.g. mass) is represented by a point and once the 

parameter passes a certain value, a qualitative change occurs, called a bifurcation, in which 

the nature of the space changes, with profound consequences for the protein 

concentrations: the cell cycle has phases (partially corresponding to Gl and G2) in which 

mass, via a stable point, controls cyclin levels, and phases (S and M phases) in which the 

concentrations change independently, but once the phase has changed at a bifurcation 

event (Cell cycle checkpoint), the system cannot go back to the previous levels since at the 

current mass the vector field is profoundly different and the mass cannot be reversed back 

through the bifurcation event, making a checkpoint irreversible. In particular the S and M 

checkpoints are regulated by means of special bifurcations called a Hopf bifurcation and an 

infinite period bifurcation. 



Mathematical/theoretical biologists 

Pere Alberch 
Anthony F. Bartholomay 
J. T. Bonner 
Jack Cowan 
Gerd B. Miiller 
Walter M. Elsasser 
Claus Emmeche 
Andree Ehresmann 
Marc Feldman 
Ronald A. Fisher 
Brian Goodwin 
Bryan Grenfell 
J. B. S. Haldane 
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William D. Hamilton 

Lionel G. Harrison 

Michael Hassell 

Sven Erik Jorgensen 

George Karreman 

Stuart Kauffman 

Kalevi Kull 

Herbert D. Landahl 

Richard Lewontin 

Humberto Maturana 

Robert May 

John Maynard Smith 

Howard Pattee 

George R. Price 

Erik Rauch 

Nicolas Rashevsky 

Ronald Brown (mathematician) 

Johannes Reinke 

Robert Rosen 

Rene Thorn 

Jakob von Uexkiill 

Robert Ulanowicz 

Francisco Varela 

C. H. Waddington 

Arthur Winfree 

Lewis Wolpert 

Sewall Wright 

Christopher Zeeman 

Mathematical, theoretical and computational biophysicists 

Nicolas Rashevsky 
Ludwig von Bertalanffy 
Francis Crick 
Manfred Eigen 
Walter Elsasser 
Herbert Frohlich, FRS 
Francois Jacob 
Martin Karplus 
George Karreman 
Herbert D. Landahl 
Ilya, Viscount Prigogine 
Sirjohn Randall 
James D. Murray 
Bernard Pullman 
Alberte Pullman 
Erwin Schrodinger 
Klaus Schulten 
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• Peter Schuster 

• Zeno Simon 

• D'Arcy Thompson 

• Murray Gell-Mann 

See also 

Abstract relational biology [42][43] [44] 

Biocybernetics 

Bioinformatics 

Biologically-inspired computing 

Biostatistics 

Cellular automata 1 

Coalescent theory 

-» Complex systems biology 

Computational biology 

Dynamical systems in biology [49] [50] [51] [52] [53] [54] 

Epidemiology 

Evolution theories and Population Genetics 

• Population genetics models 

• Molecular evolution theories 
Ewens's sampling formula 
Excitable medium 
-» Mathematical models 

• -» Molecular modelling 

• Software for molecular modeling 

• Metabolic-replication systems 

• Models of Growth and Form 

• Neighbour-sensing model 
Morphometries 

Organismic systems (OS) [57][58] 
Organismic supercategories 
Population dynamics of fisheries 
Protein folding, also blue Gene and folding@home 
Quantum computers 
Quantum genetics 
Relational biology 

Self-reproduction (also called self-replication in a more general context). 
Computational gene models 
-> Systems biology 
-» Theoretical biology 
Topological models of morphogenesis 

• DNA topology 

• DNA sequencing theory 

For use of basic arithmetics in biology, see relevant topic, such as Serial dilution. 
Biographies 

• Charles Darwin 
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D'Arcy Thompson 
Joseph Fourier 
Charles S. Peskin 
Nicolas Rashevsky 
Robert Rosen 
Rosalind Franklin 
Francis Crick 
Rene Thorn 
Vito Volterra 
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Systems biology is a 

biology-based inter-disciplinary 

study field that focuses on the 

systematic study of complex 

interactions in biological 

systems, thus using a new 

perspective (holism instead of 

reduction) to study them. 

Particularly from year 2000 

onwards, the term is used 

widely in the biosciences, and 

in a variety of contexts. 

Because the scientific method 

has been used primarily toward 

reductionism, one of the goals 

of systems biology is to discover new emergent properties that may arise from the systemic 

view used by this discipline in order to understand better the entirety of processes that 

happen in a biological system. 




Overview 

Systems biology can be considered from a number of different aspects: 

• Some sources discuss systems biology as a field of study, particularly, the study of the 
interactions between the components of biological systems, and how these interactions 
give rise to the function and behavior of that system (for example, the enzymes and 
metabolites in a metabolic pathway). 

• Other sources consider systems biology as a paradigm, usually defined in antithesis to 
the so-called reductionist paradigm, although fully consistent with the scientific method. 
The distinction between the two paradigms is referred to in these quotations: 

"The reductionist approach has successfully identified most of the components and 
many of the interactions but, unfortunately, offers no convincing concepts or methods 
to understand how system properties emerge. ..the pluralism of causes and effects in 
biological networks is better addressed by observing, through quantitative measures, 
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multiple components simultaneously and by rigorous data integration with 
mathematical models" Science 

"Systems biology... is about putting together rather than taking apart, integration 
rather than reduction. It requires that we develop ways of thinking about integration 
that are as rigorous as our reductionist programmes, but different.... It means changing 
our philosophy, in the full sense of the term" Denis Noble 

• Still other sources view systems biology in terms of the operational protocols used for 
performing research, namely a cycle composed of theory, analytic or computational 
modelling to propose specific testable hypotheses about a biological system, 
experimental validation, and then using the newly acquired quantitative description of 
cells or cell processes to refine the computational model or theory. Since the 
objective is a model of the interactions in a system, the experimental techniques that 
most suit systems biology are those that are system-wide and attempt to be as complete 
as possible. Therefore, transcriptomics, metabolomics, proteomics and high-throughput 
techniques are used to collect quantitative data for the construction and validation of 
models. 

• Engineers consider systems biology as the application of dynamical systems theory to 
molecular biology. 

• Finally, some sources see it as a socioscientific phenomenon defined by the strategy of 
pursuing integration of complex data about the interactions in biological systems from 
diverse experimental sources using interdisciplinary tools and personnel. 

This variety of viewpoints is illustrative of the fact that systems biology refers to a cluster of 
peripherally overlapping concepts rather than a single well-delineated field. However the 
term has widespread currency and popularity as of 2007, with chairs and institutes of 
systems biology proliferating worldwide (Such as the Institute for Systems Biology). 

History 

Systems biology finds its roots in: 

• the quantitative modelling of enzyme kinetics, a discipline that flourished between 1900 
and 1970, 

• the simulations developed to study neurophysiology, and 

• control theory and cybernetics. 

One of the theorists who can be seen as a precursor of systems biology is Ludwig von 
Bertalanffy with his general systems theory. One of the first numerical simulations in 
biology was published in 1952 by the British neurophysiologists and Nobel prize winners 
Alan Lloyd Hodgkin and Andrew Fielding Huxley, who constructed a mathematical model 
that explained the action potential propagating along the axon of a neuronal cell. Their 
model described a cellular function emerging from the interaction between two different 
molecular components, a potassium and a sodium channels, and can therefore be seen as 

rpi 

the beginning of computational systems biology. In 1960, Denis Noble developed the first 

rq] 

computer model of the heart pacemaker. J 

The formal study of systems biology, as a distinct discipline, was launched by systems 
theorist Mihajlo Mesarovic in 1966 with an international symposium at the Case Institute of 
Technology in Cleveland, Ohio entitled "Systems Theory and Biology." 



Systems biology 



120 



The 1960s and 1970s saw the development of several approaches to study complex 
molecular systems, such as the Metabolic Control Analysis and the biochemical systems 
theory. The successes of molecular biology throughout the 1980s, coupled with a skepticism 
toward -» theoretical biology, that then promised more than it achieved, caused the 
guantitative modelling of biological processes to become a somewhat minor field. 

However the birth of functional genomics in the 1990s meant that large guantities of high 
guality data became available, while the computing power exploded, making more realistic 
models possible. In 1997, the group of Masaru Tomita published the first guantitative 
model of the metabolism of a whole (hypothetical) cell. 

Around the year 2000, when Institutes of Systems Biology were established in Seattle and 
Tokyo, systems biology emerged as a movement in its own right, spurred on by the 
completion of various genome projects, the large increase in data from the omics (e.g. 
genomics and proteomics) and the accompanying advances in high-throughput experiments 
and bioinformatics. Since then, various research institutes dedicated to systems biology 
have been developed. As of summer 2006, due to a shortage of people in systems biology 
several doctoral training centres in systems biology have been established in many parts of 
the world. 




Techniques associated with systems biology 

According to the interpretation of onmmm . 

System Biology as the ability to 

obtain, integrate and analyze complex 

data from multiple experimental 

sources using interdisciplinary tools, 

some typical technology platforms 

are: 

• Transcriptomics: whole cell or 
tissue gene expression 
measurements by DNA microarrays 
or serial analysis of gene expression 

• Proteomics: complete identification 
of proteins and protein expression 
patterns of a cell or tissue through 
two-dimensional gel electrophoresis 

and mass spectrometry or multi-dimensional protein identification technigues (advanced 
HPLC systems coupled with mass spectrometry). Sub disciplines include 
phosphoproteomics, glycoproteomics and other methods to detect chemically modified 
proteins. 

• Metabolomics: identification and measurement of all small-molecules metabolites within 
a cell or tissue 

• Glycomics: identification of the entirety of all carbohydrates in a cell or tissue. 

In addition to the identification and guantification of the above given molecules further 
technigues analyze the dynamics and interactions within a cell. This includes: 

• -» Interactomics which is used mostly in the context of protein-protein interaction but in 
theory encompasses interactions between all molecules within a cell, 
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• Fluxomics, which deals with the dynamic changes of molecules within a cell over time, 

• Biomics: systems analysis of the biome. 

The investigations are frequently combined with large scale perturbation methods, 
including gene-based (RNAi, mis-expression of wild type and mutant genes) and chemical 
approaches using small molecule libraries. Robots and automated sensors enable such 
large-scale experimentation and data acquisition. These technologies are still emerging and 
many face problems that the larger the quantity of data produced, the lower the quality. A 
wide variety of quantitative scientists (computational biologists, statisticians, 
mathematicians, computer scientists, engineers, and physicists) are working to improve the 
quality of these approaches and to create, refine, and retest the models to accurately 
reflect observations. 

The investigations of a single level of biological organization (such as those listed above) 
are usually referred to as Systematic Systems Biology. Other areas of Systems Biology 
includes Integrative Systems Biology, which seeks to integrate different types of 
information to advance the understanding the biological whole, and Dynamic Systems 
Biology, which aims to uncover how the biological whole changes over time (during 
evolution, for example, the onset of disease or in response to a perturbation). Functional 
Genomics may also be considered a sub-field of Systems Biology. 

The systems biology approach often involves the development of mechanistic models, such 
as the reconstruction of dynamic systems from the quantitative properties of their 
elementary building blocks. For instance, a cellular network can be modelled 

mathematically using methods coming from chemical kinetics and control theory. Due to 
the large number of parameters, variables and constraints in cellular networks, numerical 
and computational techniques are often used. Other aspects of computer science and 
informatics are also used in systems biology. These include new forms of computational 
model, such as the use of process calculi to model biological processes, the integration of 
information from the literature, using techniques of information extraction and text mining, 
the development of online databases and repositories for sharing data and models (such as 
BioModels Database), approaches to database integration and software interoperability via 
loose coupling of software, websites and databases' 151 and the development of syntactically 
and semantically sound ways of representing biological models, such as the Systems 
Biology Markup Language (SBML). 



Systems biology 



122 



See also 

Related fields 

• Complex systems biology 

• Complex systems 

• Complex systems biology 

• Bioinformatics 

• Biological network 
inference 

• Biological systems 
engineering 

• Biomedical cybernetics 

• Biostatistics 

• Theoretical Biophysics 

• Relational Biology 

• Translational Research 

• Computational biology 

• Computational systems 
biology 

• Scotobiology 

• Synthetic biology 

• Systems biology modeling 

• Systems ecology 

• Systems immunology 
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DNA Dynamics 

DNA Molecular dynamics modeling involves simulations of DNA molecular geometry 
and topology changes with time as a result of both intra- and inter- molecular interactions 
of DNA. Whereas molecular models of Deoxyribonucleic acid (DNA) molecules such as 
closely packed spheres (CPK models) made of plastic or metal wires for 'skeletal models' 
are useful representations of static DNA structures, their usefulness is very limited for 
representing complex DNA dynamics. Computer molecular modeling allows both 
animations and molecular dynamics simulations that are very important for understanding 
how DNA functions in vivo. 

An old standing dynamic problem is how DNA "self-replication" takes place in living cells 
that should involve transient uncoiling of supercoiled DNA fibers. Although DNA consists of 
relatively rigid, very large elongated biopolymer molecules called "fibers" or chains its 
molecular structure in vivo undergoes dynamic configuration changes that involve 
dynamically attached water molecules, ions or proteins/enzymes. Supercoiling, packing 
with histones in chromosome structures, and other such supramolecular aspects also 
involve in vivo DNA topology which is even more complex than DNA molecular geometry, 
thus turning molecular modeling of DNA dynamics into a series of challenging problems for 
biophysical chemists, molecular biologists and biotechnologists. Thus, DNA exists in 
multiple stable geometries (called conformational isomerism) and has a rather large 
number of configurational, quantum states which are close to each other in energy on the 
potential energy surface of the DNA molecule. 

Such varying molecular geometries can also be computed, at least in principle, by 
employing ab initio -* quantum chemistry methods that can attain high accuracy for small 
molecules, although claims that acceptable accuracy can be also achieved for 
polynucleotides, as well as DNA conformations, were recently made on the basis of VCD 
spectral data. Such quantum geometries define an important class of ab initio molecular 
models of DNA whose exploration has barely started especially in connection with results 
obtained by VCD in solutions. More detailed comparisons with such ab initio quantum 
computations are in principle obtainable through 2D-FT NMR spectroscopy and relaxation 
studies of polynucleotide solutions or specifically labeled DNA, as for example with 
deuterium labels. 

Importance of DNA molecular structure and dynamics 
modeling for Genomics and beyond 

From the very early stages of structural studies of DNA by X-ray diffraction and 
biochemical means, molecular models such as the Watson-Crick double-helix model were 
successfully employed to solve the 'puzzle' of DNA structure, and also find how the latter 
relates to its key functions in living cells. The first high quality X-ray diffraction patterns of 
A-DNA were reported by Rosalind Franklin and Raymond Gosling in 1953 . The first 
reports of a double-helix molecular model of B-DNA structure were made by Watson and 
Crick in 1953 [2] [3] . Then Maurice F. Wilkins, A. Stokes and H.R. Wilson, reported the first 
X-ray patterns of in vivo B-DNA in partially oriented salmon sperm heads . The 

development of the first correct double-helix molecular model of DNA by Crick and Watson 
may not have been possible without the biochemical evidence for the nucleotide 
base-pairing ([A— T]; [C— G]), or Chargaff's rules [5] [6] [7] [8] [9] [10] . Although such initial 
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studies of DNA structures with the help of molecular models were essentially static, their 
consequences for explaining the in vivo functions of DNA were significant in the areas of 
protein biosynthesis and the quasi-universality of the genetic code. Epigenetic 
transformation studies of DNA in vivo were however much slower to develop in spite of 
their importance for embryology, morphogenesis and cancer research. Such chemical 
dynamics and biochemical reactions of DNA are much more complex than the molecular 
dynamics of DNA physical interactions with water, ions and proteins/enzymes in living cells. 



Animated DNA molecular models and hydrogen-bonding 

Animated molecular models allow one to visually explore the three-dimensional (3D) 
structure of DNA. The first DNA model is a space-filling, or CPK, model of the DNA 
double-helix whereas the third is an animated wire, or skeletal type, molecular model of 

DNA. The last two DNA molecular models in this series depict quadruplex DNA that 

ri2i r 131 
may be involved in certain cancers . The first CPK model in the second row is a 

molecular model of hydrogen bonds between water molecules in ice that are broadly similar 

to those found in DNA; the hydrogen bonding dynamics and proton exchange is however 

very different by many orders of magnitude between the two systems of fully hydrated DNA 

and water molecules in ice. Thus, the DNA dynamics is complex, involving nanosecond and 

several tens of picosecond time scales, whereas that of liquid ice is on the picosecond time 

scale, and that of proton exchange in ice is on the millisecond time scale; the proton 

exchange rates in DNA and attached proteins may vary from picosecond to nanosecond, 

minutes or years, depending on the exact locations of the exchanged protons in the large 

biopolymers. The simple harmonic oscillator 'vibration' in the third, animated image of the 

next gallery is only an oversimplified dynamic representation of the longitudinal vibrations 

of the DNA intertwined helices which were found to be anharmonic rather than harmonic as 

often assumed in quantum dynamic simulations of DNA. 
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Human Genomics and Biotechnology Applications of DNA 
Molecular Modeling 

The following two galleries of images illustrate various uses of DNA molecular modeling in 
Genomics and Biotechnology research applications from DNA repair to PCR and DNA 
nanostructures; each slide contains its own explanation and/or details. The first slide 
presents an overview of DNA applications, including DNA molecular models, with emphasis 
on Genomics and Biotechnology. 



Applications of DNA molecular dynamics computations 

• First row images present a DNA biochip and DNA nanostructures designed for DNA 
computing and other dynamic applications of DNA nanotechnology; last image in this row 
is of DNA arrays that display a representation of the Sierpinski gasket on their surfaces. 

• Second row. the first two images show computer molecular models of RNA polymerase, 
followed by that of an E. coli, bacterial DNA primase template suggesting very complex 
dynamics at the interfaces between the enzymes and the DNA template; the fourth image 
illustrates in a computed molecular model the mutagenic, chemical interaction of a 
potent carcinogen molecule with DNA, and the last image shows the different 
interactions of specific fluorescence labels with DNA in human and orangoutan 
chromosomes. 
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Image Gallery: DNA Applications and Technologies at various scales 
in Biotechnology and Genomics research 

The first figure is an actual electron micrograph of a DNA fiber bundle, presumably of a 
single plasmid, bacterial DNA loop. 
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Databases for Genomics, DNA Dynamics and Sequencing 

Genomic and structural databases 

• CBS Genome Atlas Database — contains examples of base skews. 

• The Z curve database of genomes — a 3-dimensional visualization and analysis tool of 
genomes [16][17] . 

• DNA and other nucleic acids' molecular models: Coordinate files of nucleic acids 

ri8i 
molecular structure models in PDB and CIF formats 

Mass spectrometry—Maldi informatics 



Data acquisition 




Genotype, 
mutations, etc. 



IMudeic add 
quantitation 



DNA Dynamics Data from Spectroscopy 

• FT-NMR [19] [20] 

• NMR Atlas-database [21] 

• mmcif downloadable coordinate files of nucleic acids in solution from 2D-FT NMR data 

[22] 

• NMR constraints files for NAs in PDB format [23] 
NMR microscopy [24] 
Vibrational circular dichroism (VCD) 
Microwave spectroscopy 
FT-IR 

FT-NIR [25] [26] [27] 

Spectral Hyperspectral, and Chemical imaging) [28] [29] [30] [31] [32] [33] [34] . 
Raman spectroscopy/microscopy and CARS . 

Fluorescence correlation spectroscopy , Fluorescence 

cross-correlation spectroscopy and FRET [ ] [ ] [ . 



Confocal microscopy 



[48] 
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Gallery: CARS (Raman spectroscopy), Fluorescence confocal 
microscopy, and Hyperspectral imaging 
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X-ray microscopy 

• Application of X-ray microscopy in the analysis of living hydrated cells 



[49] 



Atomic Force Microscopy (AFM) 

Two-dimensional DNA junction arrays have been visualized by Atomic Force Microscopy 
(AFM) [ ] . Other imaging resources for AFM/Scanning probe microscopy(SPM) can be 
freely accessed at: 

• How SPM Works [51] 

• SPM Image Gallery - AFM STM SEM MFM NSOM and more. [52] 

Gallery of AFM Images of DNA Nanostructures 
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External links 

• DNAlive: a web interface to compute DNA physical properties (http://mmb.pcb.ub.es/ 
DNAIive). Also allows cross-linking of the results with the UCSC Genome browser and 
DNA dynamics. 

• Application of X-ray microscopy in analysis of living hydrated cells (http://www.ncbi. 
nlm.nih.gov/entrez/query.fcgi?cmd = Retrieve&db=pubmed&dopt=Abstract& 
list_uids=12379938) 

• DiProDB: Dinucleotide Property Database (http://diprodb.fli-leibniz.de). The database is 
designed to collect and analyse thermodynamic, structural and other dinucleotide 
properties. 

• DNA the Double Helix Game (http://nobelprize.org/educational_games/medicine/ 
dna_double_helix/) From the official Nobel Prize web site 

• MDDNA: Structural Bioinformatics of DNA (http://humphry.chem. wesleyan.edu:8080/ 
MDDNA/) 

• Double Helix 1953-2003 (http://www.ncbe.reading.ac.uk/DNA50/) National Centre 
for Biotechnology Education 

• DNA under electron microscope (http://www.fidelitysystems.com/Unlinked_DNA.html) 

• Further details of mathematical and molecular analysis of DNA structure based on X-ray 
data (http://planetphysics.org/encyclopedia/ 
BesselFunctionsApplicationsToDiffractionByHelicalStructures.html) 

• Bessel functions corresponding to Fourier transforms of atomic or molecular helices. 
(http://planetphysics.org/?op=getobj&from=objects& 
name= Bessel FunctionsAndTheirApplicationsToDiffractionByHelicalStructu res) 

• Characterization in nanotechnology some pdfs (http://nanocharacterization.sitesled. 
com/) 

• An overview of STM/AFM/SNOM principles with educative videos (http://www.ntmdt. 
ru/SPM-Techniques/Principles/) 

• SPM Image Gallery - AFM STM SEM MFM NSOM and More (http://www.rhk-tech.com/ 
resu Its/showcase, php) 

• How SPM Works (http://www.parkafm.com/New_html/resources/01general.php) 

• U.S. National DNA Day (http://www.genome.gov/10506367) — watch videos and 
participate in real-time discusssions with scientists. 
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• The Secret Life of DNA - DNA Music compositions (http://www.tjmitchell.com/stuart/ 
dna.html) 

• Ascalaph DNA (http://www.agilemolecule.com/Ascalaph/Ascalaph_DNA.html) — 
Commercial software for DNA modeling 

Metastability in the brain 

In the field of computational neuroscience, the theory of metastability refers to the human 
brain's ability to integrate several functional parts and to produce neural oscillations in a 
cooperative and coordinated manner, providing the basis for conscious activity. 

Metastability, a state in which signals (such as oscillatory waves), fall outside their natural 
equilibrium state but persist for an extended period of time, is a principle that describes the 
brain's ability to make sense out of seemingly random environmental cues. In the past 25 
years, interest in metastability and the underlying framework of nonlinear dynamics has 
been fueled by advancements in the methods by which computers model brain activity. 

Overview 

EEG and EMG are used to model brain output as waveforms. In the metastability theory, 
EEG outputs produce oscillations that can be described as having identifiable patterns that 
correlate with each other at certain frequencies. Each neuron in a neuronal network 
normally outputs a dynamical oscillatory waveform, but also has the ability to output a 
chaotic waveform . When neurons are integrated into the neural network by interfacing 
neurons with each other, the dynamical oscillations created by each neuron can be 
combined to form highly predictable EEG oscillations. 

By identifying these correlations and the individual neurons that contribute to predictable 
EEG oscillations, scientists can determine which cortical domains are processing in parallel 
and which neuronal networks are intertwined. In many cases, metastability describes 
instances in which distal parts of the brain interact with each other to respond to 
environmental stimuli. 

Frequency Domains of Metastability 

It has been suggested that one integral facet of brain dynamics underlying conscious 
thought is the brain's ability to convert seemingly noisy or chaotic signals into predictable 
oscillatory patterns . 

In EEG oscillations of neural networks, neighboring waveform frequencies are correlated 
on a logarithmic scale rather than a linear scale. As a result, mean frequencies in 
oscillatory bands cannot link together according to linearity of their mean frequencies. 
Instead, phase transitions are linked according to their ability to couple with adjacent 
phase shifts in a constant state of transition between unstable and stable phase 
synchronization^ ] . This phase synchronization forms the basis of metastable behavior in 
neural networks. 

Metastable behavior occurs at the high frequency domain known as 1/f regime. This 
regime describes an environment in which a noisy signal (also known as pink noise) has 
been induced, where the amount of power the signal outputs over a certain bandwidth (its 
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power spectral density) is inversely proportional to its frequency. 

Noise at the 1// regime can be found in many biological systems - for instance, in the output 
of a heartbeat in an ECG waveform -- but serves a unique purpose for phase synchrony in 
neural networks. At the 1/f regime, the brain is in the critical state necessary for a 
conscious response to weak or chaotic environmental signals because it can shift the 
random signals into identifiable and predictable oscillatory waveforms . While often 
transient, these waveforms exist in a stable form long enough to contribute to what can be 
thought of as conscious response to environmental stimuli. 



Theories of Metastability 

Oscillatory Activity and Coordination Dynamics 

The dynamical system model, which represents networks composed of integrated neural 
systems communicating with one another between unstable and stable phases, has become 
an increasingly popular theory underpinning the understanding of metastability 
Coordination dynamics forms the basis for this dynamical system model by describing 
mathematical formulae and paradigms governing the coupling of environmental stimuli to 
their effectors . 



History of Coordination Dynamics and the Haken-Kelso-Bunz (HKB) Model 

The so-named HKB model is one of the earliest and well-respected theories to describe 
coordination dynamics in the brain. In this model, the formation of neural networks can be 
partly described as self-organization, where individual neurons and small neuronal systems 
aggregate and coordinate to either adapt or respond to local stimuli or to divide labor and 
specialize in function . 

In the last 20 years, the HKB 
model has become a 
widely-accepted theory to 
explain the coordinated 
movements and behaviors of 
individual neurons into large, 
end-to-end neural networks. 
Originally the model described 
a system in which spontaneous 
transitions observed in finger 
movements could be described 
as a series of in-phase and 




parallel 



antiparallel 



out-of-phase movements 



[6] 



Transition of parallel movement of index fingers to antiparallel, 
symmetric movement. 



In the mid-1980s HKB model experiments, subjects were asked to wave one finger on each 
hand in two modes of direction: first, known as out of phase, both fingers moving in the 
same direction back and forth (as windshield wipers might move); and second, known as 
in-phase, where both fingers come together and move away to and from the midline of the 
body. To illustrate coordination dynamics, the subjects were asked to move their fingers out 
of phase with increasing speed until their fingers were moving as fast as possible. As 
movement approached its critical speed, the subjects' fingers were found to move from 
out-of-phase (windshield-wiper-like) movement to in-phase (toward midline movement). 
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The HKB model, which has also been elucidated by several complex mathematical 
descriptors, is still a relatively simple but powerful way to describe seemingly-independent 
systems that come to reach synchrony just before a state of self-organized criticality ''■'. 

Evolution of Cognitive Coordination Dynamics 

In the last 10 years, the HKB model has been reconciled with advanced mathematical 
models and supercomputer-based computation to link rudimentary coordination dynamics 
to higher-order processes such as learning and memory. 

The traditional EEG is still useful to investigate coordination between different parts of the 
brain. 40 Hz gamma wave activity is a prominent example of the brain's ability to be 
modeled dynamically and is a common example of coordination dynamics. Continuous study 
of these and other oscillations has led to an important conclusion: analyzing waves as 
having a common signal phase but a different amplitude leads to the possibility that these 
different signals serve a synergistic function ^ ' . 

It is interesting to note some unusual characteristics of these waves: they are virtually 
simultaneous and have a very short onset latency, which implies that they operate faster 
than synaptic conduction would allow; and that their recognizable patterns are sometimes 
interrupted by periods of randomness. The latter idiosyncrasy has served as the basis for 
assuming an interaction and transition between neural subsystems. Analysis of activation 
and deactivation of regions of the cortex has shown a dynamic shift between dependence 
and interdependence, reflecting the brain's metastable nature as a function of a 
coordinated dynamical system. 

fMRI, large-scale electrode arrays, and MEG expand upon the patterns seen in EEG by 
providing visual confirmation of coordinated dynamics. The fMRI, which provides an 
improvement over EEG in spatiotemporal characterization, allows researchers to stimulate 
certain parts of the brain with environmental cues and observe the response in a holistic 
brain model. Additionally, the fMRI has a response time of about one millisecond, allowing 
for a virtually real-time investigation of the active turning -on and -off of selected parts of 

rq] 

the brain in response to environmental cues and conscious tasks . 

Social Coordination Dynamics and the Phi Complex 

A developing field in coordination dynamics involves the theory of social coordination, 
which attempts to relate the DC to normal human development of complex social cues 
following certain patterns of interaction. This work is aimed at understanding how human 
social interaction is mediated by metastability of neural networks. fMRI and EEG are 
particularly useful in mapping thalamocortical response to social cues in experimental 
studies. 

A new theory called the Phi complex has been developed by J. A. Scott Kelso and fellow 
researchers at Florida Atlantic University to provide experimental results for the theory of 
social coordination dynamics . In Kelso's experiments, two subjects were separated by 
an opaque barrier and asked to wag their fingers; then the barrier was removed and the 
subjects were instructed to continue to wag their fingers as if no change had occurred. 
After a short period, the movements of the two subjects sometimes became coordinated and 
synchronized (but other times continued to be asynchronous). The link between EEG and 
conscious social interaction is described as Phi, one of several brain rhythms operating in 
the 10 Hz range. Phi consists of two components: one to favor solitary behavior and another 
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to favor interactive (interpersonal) behavior. Further analysis of Phi may reveal the social 
and interpersonal implications of degenerative diseases such as schizophrenia -- or may 
provide insight into common social relationships such as the dynamics of alpha and 
omega-males or the popular bystander effect describing how people diffuse personal 
responsibility in emergency situations depending on the number of other individuals 
present. 

The Dynamic Core 

A second theory of metastability involves a so-called dynamic core, which is a term to 
loosely describe the thalamocortical region believed to be the integration center of 
consciousness. The dynamic core hypothesis (DCH) reflects the use and disuse of 
interconnected neuronal networks during stimulation of this region. A computer model of 

roi 

65,000 spiking neurons shows that neuronal groups existing in the cortex and thalamus 
interact in the form of synchronous oscillation. The interaction between distinct neuronal 
groups forms the dynamic core and may help explain the nature of conscious experience. A 
critical feature of the DCH is that instead of thinking binarily about transitions between 
neural integration and non-integration (i.e., that the two are either one or the other with no 
in-between), the metastable nature of the dynamic core can allow for a continuum of 

roi 

integration . 

Neural Darwinism 

One theory used to integrate the dynamic core with conscious thought involves a 
developing concept known as neural Darwinism . In this model, metastable interactions 
in the thalamocortical region cause a process of selectionism through re-entry (a 
phenomenon describing the overall reciprocity and interactivity between signals in distant 
parts of the brain through coupled signal latency). Neuronal selectivity involves 

mechanochemical events that take place pre- and post-natally whereby neuronal 

ri2i 
connections are influenced by environmental experiences . The modification of synaptic 

signals as it relates to the dynamic core provides further explanation for the DCH. 

Despite growing evidence for the DCH, the ability to generate mathematical constructs to 

ri3i 
model and predict dynamic core behavior has been slow to progress . Continued 

development of stochastic processes designed to graph neuronal signals as chaotic and 

non-linear has provided some algorithmic basis for analyzing how chaotic environmental 

signals are coupled to enhance selectivity of neural outgrowth or coordination in the 

dynamic core. 

The Global Workspace Hypothesis 

The global workspace hypothesis is another theory to elucidate metastability, and has 
existed in some form since 1983 . This hypothesis focuses again on re-entry, the ability 

r o "1 

of a routine or process to be used by multiple parts of the brain simultaneously . Both the 
DC and global neuronal workspace (GNW) models involve re-entrance, but the GNW model 
elaborates on re-entrant connectivity between distant parts of the brain and long-range 
signal flow. Workspace neurons are similar anatomically but separated spatially from each 
other. 

One interesting aspect of the GNW is that with sufficient intensity and length over which a 
signal travels, a small initiation signal can be compounded to activate an "ignition" of a 
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critical spike-inducing state. This idea is analogous to a skier on the slope of a mountain, 
who, by disrupting a few blocks of ice with his skis, initiates a giant avalanche in his wake. 
To help prove the circuit-like amplification theory, research has shown that inducing lesions 

roi 

in long-distance connections corrupts performance in integrative models . 

A popular experiment to demonstrate the global workspace hypothesis involves showing a 
subject a series of backward-masked visual words (e.g., "the dog sleeps quietly" is shown as 
"ylteiuq speels god eht") and then asking the subject to identify the forward "translation" of 
these words. Not only did fMRI detect activity in the word-recognition portion of the cortex, 
but additionally, activity is often detected in the parietal and prefrontal cortices . In 
almost every experiment, conscious input in word and audition tasks shows a much wider 
use of integrated portions of the brain than in identical unconscious input. The wide 
distribution and constant signal transfer between different areas of the brain in 
experimental results is a common method to attempt to prove the neural workspace 
hypothesis. More studies are being conducted to determine precisely the correlation 
between conscious and unconscious task deliberation in the realm of the global workspace. 

The Operational Architectonics Theory of Brain- Mind 

Although the concept of metastability has been around in the Neuroscience for some time 
, the specific interpretation of metastability in the context of brain operations of 
different complexity has been developed by Andrew and Alexander Fingelkurts. 
Metastability is basically a theory of how global integrative and local segregative 
tendencies coexist in the brain . The Operational Architectonics is centered around 

the fact that in the metastable regime of brain functioning, the individual parts of the brain 
exhibit tendencies to function autonomously at the same time as they exhibit tendencies for 
coordinated activity .In accrodence with Operational Architectonics , the 

synchronized operations produced by distributed neuronal assemblies constitute the 
metastable spatial-temporal patterns. They are metastable because intrinsic differences in 
the activity between neuronal assemblies are sufficiently large that they do each their own 
job (operation), while still retaining a tendency to be coordinated together in order to 

["221 T231 

realize the complex brain operation 

The Future of Metastability 

In addition to study investigating the effects of metastable interactions on traditional social 
function, much research will likely focus on determining the role of the coordinated 
dynamic system and the global workspace in the progression of debilitating diseases such 
as Alzheimer's Disease, Parkinson's Disease, stroke, and schizophrenia . Undoubtedly, 
spatiotemporal imaging techniques such as MEG and fMRI will elaborate on results already 
gleaned from analysis of EEG output. 

An interest in the effect of a traumatic or semi-traumatic brain injury (TBI) on the 
coordinated dynamical system has developed in the last five years as the number of TBI 
cases has risen from war-related injuries. 
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See also 

• Consciousness 

• Cognitive Psychology 

• Computational Neuroscience 

• Electroencephalogram 

• Functional MRI 

• Magnetoencephalography 

• Neural Darwinism 

• Self-organization 
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External links 

• The Human Brain and Behavior Laboratory (http://www.ccs.fau.edu/section_links/ 
HBBI_v2/) - Florida Atlantic University 

• Laboratory for Coordination Dynamics (http://www.ccs.fau.edu/html_old/coordlab. 
htm/) - Florida Atlantic University 

• BM-Science (http://www.bm-science.com/) - Brain & Mind Technologies Research 
Centre, Finland 

• An update on the global workspace theory (http://cogweb.ucla.edu/CogSci/ 

Baa rs-update_03.htm I/) - article discussing the global workspace theory in the context of 
metastability in the brain. 
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Interactomics is a discipline at the intersection of bioinformatics and biology that deals 
with studying both the interactions and the consequences of those interactions between 
and among proteins, and other molecules within a cell . The network of all such 
interactions is called the Interactome. Interactomics thus aims to compare such networks of 
interactions (i.e., interactomes) between and within species in order to find how the traits 
of such networks are either preserved or varied. From a mathematical, or -» mathematical 
biology viewpoint an interactome network is a graph or a category representing the most 
important interactions pertinent to the normal physiological functions of a cell or organism. 

Interactomics is an example of "top-down" systems biology, which takes an overhead, as 
well as overall, view of a biosystem or organism. Large sets of genome-wide and proteomic 
data are collected, and correlations between different molecules are inferred. From the 
data new hypotheses are formulated about feedbacks between these molecules. These 
hypotheses can then be tested by new experiments . 

Through the study of the interaction of all of the molecules in a cell the field looks to gain a 
deeper understanding of genome function and evolution than just examining an individual 
genome in isolation . Interactomics goes beyond cellular proteomics in that it not only 
attempts to characterize the interaction between proteins, but between all molecules in the 
cell. 

Methods of interactomics 

The study of the interactome requires the collection of large amounts of data by way of high 
throughput experiments. Through these experiments a large number of data points are 
collected from a single organism under a small number of perturbations These 
experiments include: 

• Two-hybrid screening 

• Tandem Affinity Purification 

• X-ray tomography 

• Optical fluorescence microscopy 
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Recent developments 

The field of interactomics is currently rapidly expanding and developing. While no 
biological interactomes have been fully characterized. Over 90% of proteins in 
Saccharomyces cerevisiae have been screened and their interactions characterized, making 
it the first interactome to be nearly fully specified 



[3] 



Also there have been recent systematic attempts to explore the human interactome 

[4] 
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and 
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Metabolic Network Model for Escherichia coli. 



Other species whose interactomes have been studied in some detail include Caenorhabditis 
elegans and Drosophila melanogaster. 



Criticisms and concerns 

Kiemer and Cesareni raise the following concerns with the current state of the field: 

• The experimental procedures associated with the field are error prone leading to "noisy 
results". This leads to 30% of all reported interactions being artifacts. In fact, two groups 
using the same techniques on the same organism found less than 30% interactions in 
common. 

• Techniques may be biased, i.e. the technique determines which interactions are found. 

• Ineractomes are not nearly complete with perhaps the exception of S. cerivisiae. 

• While genomes are stable, interactomes may vary between tissues and developmental 
stages. 
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• Genomics compares amino acids, and nucleotides which are in a sense unchangeable, but 
interactomics compares proteins and other molecules which are subject to mutation and 
evolution. 

• It is difficult to match evolutionarily related proteins in distantly related species. 

See also 

Interaction network 

Proteomics 

Metabolic network 

Metabolic network modelling 

Metabolic pathway 

Genomics 

-» Mathematical biology 

-* Systems biology 
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External links 

• Interactomics.org (http://interactomics.org). A dedicated interactomics web site 
operated under BioLicense. 

• Interactome.org (http://interactome.org). An interactome wiki site. 

• PSIbase (http://psibase.kobic.re.kr) Structural Interactome Map of all Proteins. 

• Omics.org (http://omics.org). An omics portal site that is openfree (under BioLicense) 

• Genomics.org (http://genomics.org). A Genomics wiki site. 

• Comparative Interactomics analysis of protein family interaction networks using PSIMAP 
(protein structural interactome map) (http://bioinformatics.oxfordjournals.org/cgi/ 
content/full/2 1/15/3234) 

• Interaction interfaces in proteins via the Voronoi diagram of atoms (http://www. 
sdencedirect.com/sdence?_ob=ArtideURL&_udi=B6TYR-4KXVD30-2&_user=10& 
_coverDate=ll/3u/2006&_rdoc=l&_fmt=&_orig=search&_sort=d&view=c& 
_acct=C0uu050221&_version=l&_urlVersion = u&_userid = 10& 
md5=8361bf3fe7834b4642cdda3b979de8bb) 

• Using convex hulls to extract interaction interfaces from known structures. Panos Dafas, 
Dan Bolser, Jacek Gomoluch, Jong Park, and Michael Schroeder. Bioinformatics 2004 20: 
1486-1490. 

• PSIbase: a database of Protein Structural Interactome map (PSIMAP). Sungsam Gong, 
Giseok Yoon, Insoo Jang Bioinformatics 2005. 

• Mapping Protein Family Interactions : Intramolecular and Intermolecular Protein Family 
Interaction Repertoires in the PDB and Yeast, Jong Park, Michael Lappe & Sarah A. 
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TeichmannJ.M.B (2001). 
• Semantic Systems Biology (http://www.semantic-systems-biology.org) 
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License 

Version 1.2, November 2002 

Copyright (C) 2000,2001,2002 Free Software Foundation, Inc. 
51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA 
Everyone is permitted to copy and distribute verbatim copies 
of this license document, but changing it is not allowed. 

0. PREAMBLE 

The purpose of this License is to make a manual, textbook, or other functional and useful document "free" in the sense of freedom: to assure everyone 
the effective freedom to copy and redistribute it, with or without modifying it, either commercially or noncommercially. Secondarily, this License 
preserves for the author and publisher a way to get credit for their work, while not being considered responsible for modifications made by others. 
This License is a kind of "copyleft", which means that derivative works of the document must themselves be free in the same sense. It complements the 
GNU General Public License, which is a copyleft license designed for free software. 

We have designed this License in order to use it for manuals for free software, because free software needs free documentation: a free program should 
come with manuals providing the same freedoms that the software does. But this License is not limited to software manuals; it can be used for any 
textual work, regardless of subject matter or whether it is published as a printed book. We recommend this License principally for works whose purpose 
is instruction or reference. 

INAPPLICABILITY AND DEFINITIONS 

This License applies to any manual or other work, in any medium, that contains a notice placed by the copyright holder saying it can be distributed under 

the terms of this License. Such a notice grants a world-wide, royalty-free license, unlimited in duration, to use that work under the conditions stated 

herein. The "Document", below, refers to any such manual or work. Any member of the public is a licensee, and is addressed as "you". You accept the 

license if you copy, modify or distribute the work in a way reguiring permission under copyright law. 

A "Modified Version" of the Document means any work containing the Document or a portion of it, either copied verbatim, or with modifications and/or 

translated into another language. 

A "Secondary Section" is a named appendix or a front-matter section of the Document that deals exclusively with the relationship of the publishers or 

authors of the Document to the Document's overall subject (or to related matters) and contains nothing that could fall directly within that overall subject. 

(Thus, if the Document is in part a textbook of mathematics, a Secondary Section may not explain any mathematics.) The relationship could be a matter 

of historical connection with the subject or with related matters, or of legal, commercial, philosophical, ethical or political position regarding them. 

The "Invariant Sections" are certain Secondary Sections whose titles are designated, as being those of Invariant Sections, in the notice that says that the 

Document is released under this License. If a section does not fit the above definition of Secondary then it is not allowed to be designated as Invariant. 

The Document may contain zero Invariant Sections. If the Document does not identify any Invariant Sections then there are none. 

The "Cover Texts" are certain short passages of text that are listed, as Front-Cover Texts or Back-Cover Texts, in the notice that says that the Document 

is released under this License. A Front-Cover Text may be at most 5 words, and a Back-Cover Text may be at most 25 words. 

A "Transparent" copy of the Document means a machine-readable copy, represented in a format whose specification is available to the general public, 

that is suitable for revising the document straightforwardly with generic text editors or (for images composed of pixels) generic paint programs or (for 

drawings) some widely available drawing editor, and that is suitable for input to text formatters or for automatic translation to a variety of formats 

suitable for input to text formatters. A copy made in an otherwise Transparent file format whose markup, or absence of markup, has been arranged to 

thwart or discourage subseguent modification by readers is not Transparent. An image format is not Transparent if used for any substantial amount of 

text. A copy that is not "Transparent" is called "Opaque". 

Examples of suitable formats for Transparent copies include plain ASCII without markup, Texinfo input format, LaTeX input format, SGML or XML using 

a publicly available DTD, and standard-conforming simple HTML, PostScript or PDF designed for human modification. Examples of transparent image 

formats include PNG, XCF and JPG. Opaque formats include proprietary formats that can be read and edited only by proprietary word processors, SGML 

or XML for which the DTD and/or processing tools are not generally available, and the machine-generated HTML, PostScript or PDF produced by some 

word processors for output purposes only. 

The "Title Page" means, for a printed book, the title page itself, plus such following pages as are needed to hold, legibly, the material this License 

requires to appear in the title page. For works in formats which do not have any title page as such, "Title Page" means the text near the most prominent 

appearance of the work's title, preceding the beginning of the body of the text. 

A section "Entitled XYZ" means a named subunit of the Document whose title either is precisely XYZ or contains XYZ in parentheses following text that 

translates XYZ in another language. (Here XYZ stands for a specific section name mentioned below, such as "Acknowledgements", "Dedications", 

"Endorsements", or "History".) To "Preserve the Title" of such a section when you modify the Document means that it remains a section "Entitled XYZ" 

according to this definition. 

The Document may include Warranty Disclaimers next to the notice which states that this License applies to the Document. These Warranty Disclaimers 

are considered to be included by reference in this License, but only as regards disclaiming warranties: any other implication that these Warranty 

Disclaimers may have is void and has no effect on the meaning of this License. 

2. VERBATIM COPYING 

You may copy and distribute the Document in any medium, either commercially or noncommercially, provided that this License, the copyright notices, 
and the license notice saying this License applies to the Document are reproduced in all copies, and that you add no other conditions whatsoever to 
those of this License. You may not use technical measures to obstruct or control the reading or further copying of the copies you make or distribute. 
However, you may accept compensation in exchange for copies. If you distribute a large enough number of copies you must also follow the conditions in 
section 3. 
You may also lend copies, under the same conditions stated above, and you may publicly display copies. 

3. COPYING IN QUANTITY 

If you publish printed copies (or copies in media that commonly have printed covers) of the Document, numbering more than 100, and the Document's 
license notice requires Cover Texts, you must enclose the copies in covers that carry, clearly and legibly, all these Cover Texts: Front-Cover Texts on the 
front cover, and Back-Cover Texts on the back cover. Both covers must also clearly and legibly identify you as the publisher of these copies. The front 
cover must present the full title with all words of the title equally prominent and visible. You may add other material on the covers in addition. Copying 
with changes limited to the covers, as long as they preserve the title of the Document and satisfy these conditions, can be treated as verbatim copying in 
other respects. 

If the required texts for either cover are too voluminous to fit legibly, you should put the first ones listed (as many as fit reasonably) on the actual cover, 
and continue the rest onto adjacent pages. 

If you publish or distribute Opaque copies of the Document numbering more than 100, you must either include a machine-readable Transparent copy 
along with each Opaque copy, or state in or with each Opaque copy a computer-network location from which the general network-using public has 
access to download using public-standard network protocols a complete Transparent copy of the Document, free of added material. If you use the latter 
option, you must take reasonably prudent steps, when you begin distribution of Opaque copies in quantity, to ensure that this Transparent copy will 
remain thus accessible at the stated location until at least one year after the last time you distribute an Opaque copy (directly or through your agents or 
retailers) of that edition to the public. 

It is requested, but not required, that you contact the authors of the Document well before redistributing any large number of copies, to give them a 
chance to provide you with an updated version of the Document. 

4.MODIFICATIONS 

You may copy and distribute a Modified Version of the Document under the conditions of sections 2 and 3 above, provided that you release the Modified 
Version under precisely this License, with the Modified Version filling the role of the Document, thus licensing distribution and modification of the 
Modified Version to whoever possesses a copy of it. In addition, you must do these things in the Modified Version: 

1. Use in the Title Page (and on the covers, if any) a title distinct from that of the Document, and from those of previous versions (which should, if there 
were any, be listed in the History section of the Document). You may use the same title as a previous version if the original publisher of that version 
gives permission. 

2. List on the Title Page, as authors, one or more persons or entities responsible for authorship of the modifications in the Modified Version, together 
with at least five of the principal authors of the Document (all of its principal authors, if it has fewer than five), unless they release you from this 
requirement. 

3. State on the Title page the name of the publisher of the Modified Version, as the publisher. 

4. Preserve all the copyright notices of the Document. 

5. Add an appropriate copyright notice for your modifications adjacent to the other copyright notices. 
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6. Include, immediately after the copyright notices, a license notice giving the public permission to use the Modified Version under the terms of this 
License, in the form shown in the Addendum below. 

7. Preserve in that license notice the full lists of Invariant Sections and reguired Cover Texts given in the Document's license notice. 

8. Include an unaltered copy of this License. 

9. Preserve the section Entitled "History", Preserve its Title, and add to it an item stating at least the title, year, new authors, and publisher of the 
Modified Version as given on the Title Page. If there is no section Entitled "History" in the Document, create one stating the title, year, authors, and 
publisher of the Document as given on its Title Page, then add an item describing the Modified Version as stated in the previous sentence. 

10. Preserve the network location, if any, given in the Document for public access to a Transparent copy of the Document, and likewise the network 
locations given in the Document for previous versions it was based on. These may be placed in the "History" section. You may omit a network 
location for a work that was published at least four years before the Document itself, or if the original publisher of the version it refers to gives 
permission. 

11. For any section Entitled "Acknowledgements" or "Dedications", Preserve the Title of the section, and preserve in the section all the substance and 
tone of each of the contributor acknowledgements and/or dedications given therein. 

12. Preserve all the Invariant Sections of the Document, unaltered in their text and in their titles. Section numbers or the eguivalent are not considered 
part of the section titles. 

13. Delete any section Entitled "Endorsements". Such a section may not be included in the Modified Version. 

14. Do not retitle any existing section to be Entitled "Endorsements" or to conflict in title with any Invariant Section. 

15. Preserve any Warranty Disclaimers. 

If the Modified Version includes new front-matter sections or appendices that gualify as Secondary Sections and contain no material copied from the 

Document, you may at your option designate some or all of these sections as invariant. To do this, add their titles to the list of Invariant Sections in the 

Modified Version's license notice. These titles must be distinct from any other section titles. 

You may add a section Entitled "Endorsements", provided it contains nothing but endorsements of your Modified Version by various parties-for example, 

statements of peer review or that the text has been approved by an organization as the authoritative definition of a standard. 

You may add a passage of up to five words as a Front-Cover Text, and a passage of up to 25 words as a Back-Cover Text, to the end of the list of Cover 

Texts in the Modified Version. Only one passage of Front-Cover Text and one of Back-Cover Text may be added by (or through arrangements made by) 

any one entity. If the Document already includes a cover text for the same cover, previously added by you or by arrangement made by the same entity 

you are acting on behalf of, you may not add another; but you may replace the old one, on explicit permission from the previous publisher that added the 

old one. 

The author(s) and publisher(s) of the Document do not by this License give permission to use their names for publicity for or to assert or imply 

endorsement of any Modified Version. 

5. COMBINING DOCUMENTS 

You may combine the Document with other documents released under this License, under the terms defined in section 4 above for modified versions, 

provided that you include in the combination all of the Invariant Sections of all of the original documents, unmodified, and list them all as Invariant 

Sections of your combined work in its license notice, and that you preserve all their Warranty Disclaimers. 

The combined work need only contain one copy of this License, and multiple identical Invariant Sections may be replaced with a single copy. If there are 

multiple Invariant Sections with the same name but different contents, make the title of each such section unigue by adding at the end of it, in 

parentheses, the name of the original author or publisher of that section if known, or else a unigue number. Make the same adjustment to the section 

titles in the list of Invariant Sections in the license notice of the combined work. 

In the combination, you must combine any sections Entitled "History" in the various original documents, forming one section Entitled "History"; likewise 

combine any sections Entitled "Acknowledgements", and any sections Entitled "Dedications". You must delete all sections Entitled "Endorsements." 

6. COLLECTIONS OF DOCUMENTS 

You may make a collection consisting of the Document and other documents released under this License, and replace the individual copies of this 
License in the various documents with a single copy that is included in the collection, provided that you follow the rules of this License for verbatim 
copying of each of the documents in all other respects. 

You may extract a single document from such a collection, and distribute it individually under this License, provided you insert a copy of this License into 
the extracted document, and follow this License in all other respects regarding verbatim copying of that document. 

7.AGGREGATION WITH INDEPENDENT WORKS 

A compilation of the Document or its derivatives with other separate and independent documents or works, in or on a volume of a storage or distribution 
medium, is called an "aggregate" if the copyright resulting from the compilation is not used to limit the legal rights of the compilation's users beyond 
what the individual works permit. When the Document is included in an aggregate, this License does not apply to the other works in the aggregate which 
are not themselves derivative works of the Document. 

If the Cover Text reguirement of section 3 is applicable to these copies of the Document, then if the Document is less than one half of the entire 
aggregate, the Document's Cover Texts may be placed on covers that bracket the Document within the aggregate, or the electronic eguivalent of covers 
if the Document is in electronic form. Otherwise they must appear on printed covers that bracket the whole aggregate. 

8.TRANSLATION 

Translation is considered a kind of modification, so you may distribute translations of the Document under the terms of section 4. Replacing Invariant 
Sections with translations requires special permission from their copyright holders, but you may include translations of some or all Invariant Sections in 
addition to the original versions of these Invariant Sections. You may include a translation of this License, and all the license notices in the Document, 
and any Warranty Disclaimers, provided that you also include the original English version of this License and the original versions of those notices and 
disclaimers. In case of a disagreement between the translation and the original version of this License or a notice or disclaimer, the original version will 
prevail. 

If a section in the Document is Entitled "Acknowledgements", "Dedications", or "History", the reguirement (section 4) to Preserve its Title (section 1) will 
typically require changing the actual title. 

9.TERMINATION 

You may not copy, modify, sublicense, or distribute the Document except as expressly provided for under this License. Any other attempt to copy, modify, 
sublicense or distribute the Document is void, and will automatically terminate your rights under this License. However, parties who have received 
copies, or rights, from you under this License will not have their licenses terminated so long as such parties remain in full compliance. 

10. FUTURE REVISIONS OF THIS LICENSE 

The Free Software Foundation may publish new, revised versions of the GNU Free Documentation License from time to time. Such new versions will be 
similar in spirit to the present version, but may differ in detail to address new problems or concerns. See http://www.gnu.org/copyleft/. 
Each version of the License is given a distinguishing version number. If the Document specifies that a particular numbered version of this License "or 
any later version" applies to it, you have the option of following the terms and conditions either of that specified version or of any later version that has 
been published (not as a draft) by the Free Software Foundation. If the Document does not specify a version number of this License, you may choose any 
version ever published (not as a draft) by the Free Software Foundation. 

How to use this License for your documents 

To use this License in a document you have written, include a copy of the License in the document and put the following copyright and license notices 
just after the title page: 

Copyright (c) YEAR YOUR NAME. 

Permission is granted to copy, distribute and/or modify this document 

under the terms of the GNU Free Documentation License, Version 1.2 

or any later version published by the Free Software Foundation; 

with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. 

A copy of the license is included in the section entitled "GNU 

Free Documentation License". 
If you have Invariant Sections, Front-Cover Texts and Back-Cover Texts, replace the "with. ..Texts." line with this: 

with the Invariant Sections being LIST THEIR TITLES, with the 

Front-Cover Texts being LIST, and with the Back-Cover Texts being LIST. 
If you have Invariant Sections without Cover Texts, or some other combination of the three, merge those two alternatives to suit the situation. 
If your document contains nontrivial examples of program code, we recommend releasing these examples in parallel under your choice of free software 
license, such as the GNU General Public License, to permit their use in free software. 



